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0© (54) Title: XYLOSYLTRANSFERASE AND ISOFORMS THEREOF 

(57) Abstract: The invention relates to the isolation, purification and characterization of the enzyme xylosyltransferase (defined 
^ as "XT'). The invention describes for the first time that XT occurs in at least two isoforms ("XT-I", "XT-IT). The invention re- 
lates furthermore to the recombinant cloning and expression of human and rat XT-I and XT-II and discloses their DNA and protein 
Q sequences. The enzymes according to the invention can be used as therapeutic agents and as diagnostic markers, e.g. for the de- 
^ termination of enhanced proteoglycan biosynthesis, and as biochemical markers for determination of several pathological processes 
^ such as systemic sclerosis. 
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Xylosyltransferase and Isoforms thereof 



The invention relates to the isolation, purification and characterization of the enzyme 
xylosyltransferase (defined as "XT"). The invention describes for the first time that XT occurs 

5 in at least two isoforms ("XT-I", "XT-II"). The invention relates furthermore to the recombinant 
cloning and expression of human and rat XT-I and XT-II and discloses their DNA and protein 
sequences. The enzymes according to the invention can be used as therapeutic agents and as 
diagnostic markers, e.g. for the determination of enhanced proteoglycyn biosynthesis, and as 
biochemical markers for determination of several pathological processes such as systemic 

10 sclerosis. 

Background of the Invention 

Proteoglycans are polyanionic molecules widely expressed in animal cells and virtually every 
tissue. These abundant molecules are present in the extracellular matrix and on the cell surface 

15 and serve a wide range of functions. They are increasingly implicated as important regulators in 
many biological processes, such as extracellular matrix deposition, cell membrane signal 
transfer, morphogenesis, cell migration, normal and tumor cell growth and viral infection 
(Ruoslahti, 1989, J. Biol Chem. 264, 13369-1372; Heroldtf a/., 1994,7. Gen. Virol 75, 1211- 
1222). Proteoglycans mediate diverse cellular processes through interaction with a variety of 

20 protein ligands. In most of these bindings electrostatic interactions with the glycosaminoglycan 
chains attached to the core protein are involved (Kjellen & Lindahl, 1991, Annu. Rev. Biochem. 
60, 443-475). Thus, the biological activity of proteoglycans is intimately related to the 
glycosaminoglycan biosynthesis. 

25 The sulfated glycosaminoglycans chondroitin sulfate, heparan sulfate, heparin and dermatan 
sulfate are bound to the proteoglycan core protein by a xylose-galactose-galactose binding 
region (Kjellen & Lindahl, 1991, lc). UDP-D-xylose:proteogiycan core protein (5-D- 
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various species {Hoffmann et al, 1984, Connect Tissue. Res. 12, 151-164), and it was shown 
that the enzyme is secreted from the endoplasmatic reticulum into the extracellular space 
together with chondroitin sulfate proteoglycans (KMhnert et al., 1991, Eur. 1 Clin. Chem. Clin. 
Biochem. 29, 624-625; Gotting et al., 1999, J. Invest. Dermatol 112 r 919-924). However, the 
5 processes resulting in the release of XT from the endoplasmatic reticulum or the Golgi 
compartments and the role of XT in the extracellular matrix are not yet known . 

As XT is the initial step enzyme in the biosynthesis of the glycosaminoglycan linkage region 
and as it is secreted into the extracellular matrix to a great extent, XT activity was proposed to 

10 be a diagnostic marker for the determination of an enhanced proteoglycan biosynthesis and of 
tissue destruction (Weilke et al., 1997, Clin. Chem. 43, 45-51). XT activities in the synovial 
fluid were found to be significantly increased in chronic inflammatory joint diseases (Kleesiek 
et al., 1987, J. Clin. Chem. Clin Biochem. 25, 473-481). Recent studies have shown that serum 
XT activity is a biochemical marker for the determination of fibrotic activity in systemic 

15 sclerosis (Gotting et ai.,1999, lc; Gotting et al., 2000, Acta Derm. Venereol 80, 60-61.). 

Up to now there was no success to isolate or characterize xylosyltransferase (XT), however 
some methods were described to measure and determine the activity of said enzyme from blood 
or body fluid samples of patients showing pathological effects such as scleroderma or chronic 

20 joint diseases ^Stoolmiller, 1972, J. Biol Chem, 247, 3525-3532). The samples were incubated 
with UDP-[ l4 C]xylose and an appropiate acceptor. The incorporated radioactivity indicated the 
amount of XT activity. Acceptors used so far are proteoglycans, silk fibroin (Campbell et al., 
1984, Anal Biochem. 137, 505-516) and several peptides (Bourdon et al., 1987, Proc. Natl 
Acad. Sci. USA 84, 3194-3198). However, the all activity tests used herein did not allow precise 

25 determination of the lower XT activity in serum. A more specific acceptor protein is 

recombinant bikunin, the inhibitory component of human inter-a-trypsin inhibitor. Bikunin 
carries a single chondroitin, which is essential for the structure of the inhibitor. The chondroitin 
sulfate attachement site in the N-tenninal region contains all elements responsible for 
recognition by XT. The complete recognition sequence is composed of the amino acids a-a-a-a- 

30 G-S-G-a-b-a, with a = E or D and b = G, E or D. This sequence was confirmed by 

determination of the Michaelis-Menten (K m ) constants for in vitro xylosylation of different 
synthetic proteins and peptides using an enriched XT preparation from conditioned cell culture 
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supernatant of human chondrocytes. The constant was determined to be 22 \xM 9 which was 
decreased 9-fold in comparison to deglycosylated core protein from bovine cartilage (188 jiM) 
(Brinkmann et al., 1997, J. Biol Chem. 272, 11171-11175). With recombinant bikunin as 
acceptor, a sensitive assay was developed that allows precise determination of XT activity in 
5 human serum and other body fluids (see: Weilke et al., 1997, I.e.). Using this assay an increased 
xylosyltransferase activity was determined in blood of patients with sclerotic processes of 
scleroderma, closely related to an elevated proteoglycan biosynthesis (GOtting et al., 1999, l.cj. 

The biosynthesis of glycosaminoglycans requires the coordinated action of a large number of 
10 glycosyltransferases. Isolation and cloning of these glycosyltransferases has been targeted for a 
long time, since the majority of these enzymes are only present in minute amounts. The 
structure and sequence of the glycosyltransferases involved in biosynthesis of the common 
glycosaminoglycan-protein linkage region has long remained unknown. Recent cDNA cloning 
of galactosyltransferase I (Okajima et al., 1999, 1 Biol Chem. 274, 22915-22918) and 
15 glucuruonyltransferase I (Kitagawa et al., 1998, J. Biol Chem. 273, 6615-6618) identified 2 of 
the at least 4 enzymes involved in synthesis of the GlcApi-3Gaipi-3Gaipi-4Xyipi-0-Ser 
structure. 

Isolation, purification and characterization of XT involved in biosynthesis of the common 
20 carbohydrate-protein linkage structure has been hampered by difficulties in obtaining a 
sufficient amount of the source material. Since, as above mentioned, XT can be used as an 
additional biochemical marker for the determination of sclerotic activity in systemic sclerosis 
and some inflammatory disorders, it is a real need for an isolated, highly purified XT which can 
be produced by recombinant methods, which is necessary for diagnostic and therapeutic 
25 purposes. Moreover, the knowledge of the cDNA sequence of XT allows to use it on gene level 
such as in gene diagnostic or gene therapy. 

Definitions 

• Above and below the term ,! XT n means xylosyltransferase (UDP-D-xylose:proteoglycan 
30 core protein p-D-xylosyltransferase (EC 2.4.2.26) deriving from any origin and includes all 
possible isoforms, if not otherwise pointed out. The term "hXT" has the meaning of human 
XT; the term M rXT" of rat XT. The terms "XT-I" , "XT-IT, "hXT-I", "hXT-IT, n rXT-I2, 
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"rXT-IT mean the specific isoforms 1 and 2 of XT according to the invention, wherein h 
and r have the indicated meanings. 

• Above and below the term "protein" means a protein, a protein fragment or a peptide, if not 
otherwise explained. 

5 • Above and below the term "recombinant protein" is defined as a protein which was 
produced by recombinant and biotechnological methods. 

• Above and below the term "XT" or "XT protein(s)" has the meaning of (a) protein(s) 
deriving from any source, if not otherwise stated out, having the biological activity and / or 
function of UDP-D-xylose proteoglycan core protein fl-D-xylosyl transferase. 

10 • Above and below the term "a protein having the biological activity of UDP-D- 

xylose:proteoglycan core protein B-D-xylosyltransferase " is defined as a protein which has 
XT - identical or XT - like functions and activities and comprises XT itself and possible 
mutations, variants, isoforms thereof, including insertions, deletions and substitutions of one 
ore more amino acids. The term includes also fragments or longer forms of XT as well as 

15 dimeric or multimeric forms thereof showing XT functions. 

• Above and below the term "isoform" means a second naturally occurring enzyme having the 
same biological activity as a first naturally occurring enzyme, however differing by another 
amino acid sequence. 

20 Summary of the Invention 

It was found that XT can be purified 4,700-fold with 1% yield from serum-free JAR 
choriocarcinoma cell culture supernatant. The isolation procedure includes a combination of 
ammonium sulfate precipitation, heparin affinity chromatography > ion exchange 
chromatography and protamine affinity chromatography. Amongst other proteins an unknown 

25 protein was detected by matrix-assisted laser desorption ionization mass spectrometry-time of 
flight analysis (MALDI-TOF) in the purified sample. The molecular weight of this isolated and 
purified XT protein was determined as 120.000 by SDS-polyacrylamide gel electrophoresis. 
The isolated protein was enzymatically cleaved by trypsin and endoproteinase Lys-C. Peptide 
fragments were sequenced by Edman degradation. Searches with the amino acid sequences in 

30 protein and EST databases showed no homology to known sequences. XT was enriched by 
immunoaffinity chromatography with an immobilized antibody against a synthetic peptide 
deduced from the sequenced peptide fragments and was specifically eluted with the antigen. In 
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addition, XT was purified alternatively with an aprotinin affinity chromatography and was 
detected by western blot analysis in the enryme-containing fraction. 

Based on the partial amino acid sequence of their isolated and purified new enzyme (XT) 

5 derived from human JAR choriocarcinoma cell culture supernatant a novel cDNA was isolated 
according to the invention, encoding human XT-I enzyme using the degenerate reverse 
transcriptase-polymerase chain reaction method. The enzyme belongs to a novel family of 
glycosyltransferases having no homology to proteins of prior art 5'- and 3 '-RACE were 
performed to isolate a novel cDNA fragment of 3726 bp with a single open reading frame 

10 encoding at least 827 amino acids with a molecular mass of 91 .000. The human XT-I gene was 
located on chromosome 16pl3.1 using radiation hybrid mapping, and extracts from CHO-K1 
cells transfected with the XT-I cDNA in an expression vector exhibited marked XT activity. 
Furthermore, a new 3608-bp cDNA fragment encoding a novel protein of 865 amino acids was 
also isolated by PCR using degenerate primers based on the amino acid sequence of human XT- 

15 I. The amino acid sequence of this XT-II isoform displays 55% identity to the human XT-I. The 
XT-II gene is located on chromosome 17q21.3-17q22, and the exon/intron structure of the 15 
kb gene was determined. RT-PCR analyses of XT-I and XT-II mRNA from various tissues 
confirmed that both XT-I and XT-II transcripts are ubiquitously expressed in the human tissues, 
although with different levels of transcription. Furthermore, the cDNAs encoding XT-I and XT- 

20 II from rat were cloned. The deduced amino acid sequences of rat xylosyltransferases displayed 
94% identity to the corresponding human enzyme. 

Therefore, it is an object of this invention to provide the following subject-matters: 

• An isoform of UDP-D-xylose:proteoglycan core protein J$~D-xylosyltransferase(XT)\ 

25 • a protein comprising a sequence of said isoform or a fragment thereof, having the biological 
activity of XT; 

• a corresponding protein deriving from human or rat sources (hXT, rXT); 

• a corresponding protein isolated from specific human tissue, wherein said hXT has a 
molecular weight of 120.000 under SDS PAGE conditions; 

30 • a corresponding recombinant protein, wherein said protein is hXT-I comprising at least 827 
amino acids and having the amino acid sequence as depicted in Fig, 7B; 
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• a corresponding isoform of hXT-I 5 termed as hXT-II, comprising 865 amino acids, 
exhibiting approximately 55% overall sequence identity to human hXT-I, and having, in 
more detail, the amino acid sequence as depicted in Fig. 8B. 

• a process for isolating and purifying a protein having the biological activity of human UDP- 
5 D-xylose:proteoglycan core protein B-D-xylosyltransferase (EC 2.4.2.26), said process is 

characterized by the following steps: 

(i) culturing from human tissue showing an enhanced XT activity, preferably 
from JAR choriocarcinoma cells (ATCC HTB-144), and harvesting the cell 

4 

culture supernatant, 

l o (ii) fractionated ammonium sulfate precipitation of the supernatant of step (i), 

(iii) heparin affinity chromatography of the precipitate of step (ii), 

(iv) ion exchange chromatography of the step (iii) product, and 

(v) affinity chromatography of the step (iv) product, and optionally 
(vi a SDS-Polyacrylamide gel elektrophoresis of step (v); 

15 • recombinant forms of hXT and rXT, the corresponding DNA sequences (Fig. 7 A, 8 A, 9, 10) 
included as well as suitable expression vectors and expression host cell systems; 

• antibodies directed against any of the above or below mentioned XT proteins and their uses 
in immunological assays and diagnostic tools for determining said XT proteins; 

• pharmaceutical compositions comprising a XT protein as defined above and below, 
20 optionally together with a suitable pharmacologically acceptable carrier, diluent or 

excipient; 

• uses of said XT proteins for the manufacture of a medicament for the treatment of XT 
relevant diseases and disorders, wherein the xylosyltransferase enzymes according to the 
invention can either be used directly as therapeutic drug in pathological situations where a 

25 deficiency of said enzyme and its isoforms can be detected; 

• uses of said XT enzymes for the manufacture of a medicament for the treatment of diseases 
and disorders which are caused or accompanied by increased levels of said enzymes, e.g. in 
sclerotic diseases and chronic joint diseases, wherein said medicament is an inhibitor or 
antagonist of said XT proteins; 

30 • uses of said XT proteins as diagnostic markers for the diagnosis of above- and below- 
mentioned diseases and pathological symptoms and uses of said DNA molecules as gene 
markers; 
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• methods of screening for compounds which are capable to inhibit the activity of said XT 
proteins according to the invention using known methods for determining the activity of 
XT. 

5 Detailed Description: 

General 

UDP-D-xylose: proteoglycan core protein fi-D-xylosyltransferase can be isolated and purified 
from JAR choriocarcinoma cell culture. The isolated protein according to the invention is a 
single stranded polypeptide with a molecular weight of 120.000. The protein was enzymatically 

10 cleaved and eleven peptide fragments were sequenced by Edman degradation (see Examples). 
XT is present only in very small amounts in animal tissues but unlike other 
glycosyltransferases, more than 90% of XT is enriched in the medium of cultured cells. The 
highest secretion of XT activity was measured in JAR choriocarcinoma cell culture, in which 
sternal cartilage chondrocytes and 21 different human cell lines were examined. To produce a 

15 highly enriched XT solution for the isolation of XT, JAR choriocarcinoma cells were adapted to 
hollow fiber culture conditions using a novel bioreactor (TECNOMOUSE) and Ultradoma-PF 
medium without serum addition as nutrient. For purification of XT a combination of classic 
separation methods and new affinity matrices was employed. Therefore, a heparin matrix was 
used as an affinity ligand for the XT. When applied to immobilized heparin, XT was 

20 completely adsorbed at the matrix and the XT activity was eluted only with a high salt 

concentration after most contaminating proteins have been removed from the matrix. Protamine 
chloride is well-known as cationic activator for several sulfotransferases, so the effect of 
protamine chloride on the XT was investigated. An increased XT activity was measured when 
protamine chloride was added to the XT assay solution (see Examples), indicating an 

25 interaction of these arginine rich proteins with XT. Therefore, a protamine chloride affinity 
matrix was synthesized using an aldehyde activated perfusion medium as support. The 
interaction of XT with this affinity matrix resulted in a 13.6 times enrichment of XT. The 
protamine chloride affinity chromatography was the most efficient purification step during the 
isolation of the XT. Immobilized aprotinin, a Kunitz-type proteinase inhibitor, was found to be 

30 another appropriate affinity matrix for enrichment of XT, as it was able to adsorb XT 
quantitatively. Therefore, it was used for alternative purification of XT. Different lines of 
evidence showed that the isolated protein corresponds to the XT: (a) XT activity was enriched 
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using immobilized antibodies raised against a synthetic peptide deduced from the 120.000 
protein, and the XT activity could be competitively eluted with this peptide, (b) Immunoblot 
analysis of aprotinin affinity purified XT corresponds with the 120.000 protein. However, non- 
reducing and non-denaturating gel filtration chromatography with heparin affinity-purified XT 

5 from JAR cell culture supernatant shows an additional peak of XT activity at a molecular 
weight of approximately 500.000. XT is associated with proteoglycans. Treatment of XT with 
N-glycosidase F resulted in a decrease of the molecular weight from 120.000 to 1 16.000, 
suggesting that the XT is a glycoprotein. A comparison of the molecular mass of XT with other 
glycosyltransferases involved in biosynthesis of proteoglycans shows that the XT is larger than 

10 the other enzymes. Another difference is that nearly all proteoglycan glycosyltransferases are 
tightly bound to the membrane of the endoplasmic reticulum, whereas XT is secreted into the 
extracellular space. XT according to the invention contains like many glycosyltransferases a 
DxD motif, suggesting that this motif is involved in binding the metal-ion cofactor and the 
donator substrate (Gastinel et al., 1999, EMBOJ. 18, 3546-3557). A DxD sequence was also 

15 found in peptide 8 obtained from the enzymatically cleaved XT. 

The invention discloses for the first time the molecular cloning and expression of human and 
rat XT. Surprisingly, XT is found in at least two isomeric forms, which are termed according to 
the invention as XT-I and XT-II. Based on the amino acid sequence a novel cDNA was cloned, 

20 which encodes a protein of at least 827 amino acids with a molecular mass of 91 .000. A DxD 
motif was identified in the XT-I amino acid sequence using hydrophobic cluster analysis. This 
motif has been observed in many glycosyltransferases, and is involved in the coordination of a 
divalent cation in the binding of the nucleotide-sugar (Breton & Imberty, 1999, Curr. Opin. 
Struct. Biol. 9, 563-571 .). The translation initiation codon of the XT-I cDNA could not be yet 

25 cloned probably due to strong secondary structures in the 5 '-region of the XT-I mRNA. The 
XT-I protein isolated from human JAR choriocarcinoma cell culture supernatant migrated on 
SDS-PAGE with a molecular mass of 120.000 and the protein size could be further reduced 
after N-glycanase digestion. These findings indicate that the cloned cDNA represents at least 
90% of the coding region of human XT-I. 

30 

Another cDNA was completely identified from human placenta RNA, which was similar but 
not identical to the hXT-I cDNA. The new cDNA encodes a protein of 865 amino acids with a 
molecular weight of 97.000. This novel protein termed XT-I has a proline-rich region located 
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near the ammo-terminus and the type II transmembrane topology characteristic of many other 
glycosyltransferases cloned to date. The hXT-II protein was similar to the human XT-I with an 
overall sequence identity of 55%. The similarity of the predicted amino acid sequences was low 
(<10%) at the amino-tenninal end. However, large stretches at the C-terminal region, where the 
5 catalytic domain was found to be located in glycosyltransferases, are very conserved with an 
identity of more than 80% in both proteins. These findings let conclude that the XT-II gene 
encodes another human xylosyltransferase, although the catalytic activity and the biological 
role of XT-II remain to be elucidated in detail. 

10 Since alterations in XT activity have been reported to be associated with fibrotic and sclerotic 
alterations of connective tissue (Getting et al 9 1999; Getting et aL, 2000, I.e.), the present 
findings provide molecular tools to study the function and the regulated expression of human 
XT as well as the molecular mechanisms of these diseases. 

15 Production of XT 

JAR choriocarcinoma cells which had been adapted to growth in the serum-free Ultradoma-PF 
medium secreted XT activity into the cell culture supernatant. During the exponential growth 
XT activity and total protein concentration in the supernatant of a traditional cell culture system 
(T-flasks) were determined as 0.2 mU/1 and 0.1 g/1, respectively. The cells were cultivated 

20 using three "Tecnomouse"® bioreactor systems. Each bioreactor contained five culture 

casettes. About 10 7 cells per culture casette were inoculated and medium probes of about 0,5 ml 
were taken every day to determine the viability of the cells and the glucose, and lactate 
concentration as well as the XT activity of the cell culture supernatant The cells amounted in 
the probes varies from 10 5 to 10 7 cells per ml with viability between 40 and 70%. XT 

25 production increased within three weeks after cell inoculation, reaching a plateau of 

approximately 30 mU/1. After three months the culture cassette was removed. Harvesting was 
carried out every two days, collecting 10 ml cell culture supernatant per culture cassette. The 
cell-free supernatant was collected and stored at -75°C. Mean XT activity in the supernatant of 
the high density culture was determined as 29.0 mU/1, while the total protein concentration was 

30 estimated as 4.8 g/l. In total 18.51 high density cell culture supernatant was collected and 
yielded 535.8 mU XT. 
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Isolation and purification of XT from cell culture supernatant 

XT was purified form 18.5 1 supernatant (equivalent to 2.000 1 normal cell culture supernatant) 
of serum-free cultivated JAR choriocarcinoma cells to an apparent homogeneity of about 4700- 
fold purification. A summary of the purification steps for isolation of XT is shown in Table 1 . 
5 XT activity of the crude supernatant was approximately 0.006 mU/mg protein. The purification 
method according to the invention comprises four, preferably five different steps. It is possible, 
that one step can be achieved more than once if necessary. 

Step 1: Fractionated ammonium sulfate precipitation XT of the ammonium sulfate precipitable 
fraction was dissolved in 0.46 1 buffer A with solubilization of 79.5% of the original activity. 

10 Step 2: Heparin affinity chromatography on POROS 20 HE 4 ml of XT-enriched solution from 
step 1 was loaded onto the POROS 20 HE column. XT activity was completely retained on the 
column. More than 70% of total protein passed through the column. Contaminating protein was 
eluted at a low NaCl concentration. 44% of the XT activity bound to the heparin matrix 
emerged at 0.5 MNaCl (Fig. 1A). 

15 Step 3: Ion exchange chromatography on POROS 20 HQ 4 ml of the desalted XT-containing 
fraction from step 2 was loaded onto the POROS 20 HQ column equilibrated in buffer A. More 
than 98% of the XT activity bound to the resin. The column was then eluted stepwise with NaCl 
in buffer A (Fig. IB). XT-containing fractions were collected. 

Step 4: Affinity chromatography on protamine chloride The product of step 3 was desalted and 
20 concentrated using dia- and ultrafiltration. 100 |il of the protein solution was applied to the 
POROS protamine chloride column previously equilibrated with buffer A. Approximately 95% 
of the transferase activity bound to the column, whereas 75% of the contaminating protein did 
not. Additional proteins were eluted with buffer A containing low NaCl concentrations. 
Enzyme activity was eluted at approximately 0.15 M NaCl (Fig. 1C). The enzyme activity 
25 was stable for at least 6 months at -75°C. 

Step 5: SDS-Polyacrylamide gel electrophoresis (SDS-PAGE) XT-containing fractions from 
step 1-4 were subjected to SDS-PAGE on a 4-12% gradient polyacrylamide gel (Fig. 2, panel 
A). Coomassie-stained protein bands were excised and characterized by MALDI-TOF mass 
spectrometry after tryptic digestion. The molecular weight of an unknown protein was 
30 determined as 120.000 (Fig. 2, panel B). 
Step 5 is an optionally step. 
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Table 1: 

Summary of single purification steps employed for isolation of XT from 18.5 I of high density JAR 



cell culture supernatant. 



Step 


Volume 


Total activity Total protein 


Spec, activity Purification 


Recovery 




ml 


10-3 x units 


mg 


10-3 xunits/mg 


-fold 


% 


JAR high density cell culture 












Supernatant 


18,500 


535.8 


89,355.0 


0.006 


1 


100 


Ammonium sulfate 














precipitation 


460 


426.0 


8,937.8 


0.048 


8 


79 


Heparin affinity 














chromatography 


50 


108.3 


473.0 


0.229 


40 


35 


Ion exchange 














Chromatography 


5 


91.0 


43.1 


2.090 


348 


17 


Protamine affinity 














chromatography 


1 


6.83 


0.24 


28.458 


4,743 


1 



5 Amino acid sequence analysis of XT. The MW 120.000 protein from the excised band was 
digested with trypsin and endoproteinase Lys-C. The proteolytic fragments were separated by 
reversed-phase HPLC, and selected peptides were subjected to automatic amino acid 
sequence analysis; Table II shows the obtained 1 1 amino acid sequences determined by Edman 
degradation and mass spectrometry. 

10 Table H X represent a not identified residue. The masses of three peptides were observed in the 
MALDI mass spectrum of the enzymatically digested MW 120.000 protein. Calculated mass values 
were obtained from the sequence obtained. 



Peptide 


observed mass 
(M + H*) 


calculated mass 
(M + H*) 


(l)ELGAK 






(2) E L L K 






(3) D M N F L K 






(4)IASPPSDFGR 


1045.5 


1045.5 


(5) L L L D 






(6)DFENVDNSNF APR 


1524.7 


1525.7 


(7) P T F F A R 






(8)LQFSEVGT D X D A 






(9)ELGAVKPDGRL 


1152.6 


1153.7 


(10) ELLKRKLEQQEK 






(ll)LGLLMPEK 
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Immunochemical detection of XT 

Polyclonal antibodies against the synthetic peptide CSRQKELLKRKLEQQEK deduced from 
the peptides 2 and 10 of the enzymatically cleaved MW 120.000 protein were covalently bound 
on POROS 20 PA. About 50% of the XT activity of an applied sample was bound (Fig. 3, panel 

5 A) when a partially purified XT sample obtained by heparin affinity chromatography 

(purification step 2) was loaded onto the column. 58% of the adsorbed XT activity was eluted 
with 1 50 mM NaCl, and the rest was eluted with 12 mM HC1. Furthermore, the adsorbed XT 
activity was also eluted from the solid phase when 100 ^1.(1 mg/ml) of the synthetic peptide 
was added to the mobile phase (Fig. 3, panel C). When immobilized preimmune serum was 

10 used as affinity matrix (negative control) no XT activity was adsorbed to or eluted from 
the matrix (Fig. 3, panel B). The desalted XT fraction after heparin affinity chromatography 
(purification step 2) was loaded on an aprotinin affinity column. The elution profile shows four 
major protein peaks (Fig. 4, panel I). 61 % of the XT activity adsorbed to the aprotinin matrix 
emerged at 0.30 M NaCl and another 21% at 0.54 M NaCl. A single MW 120.000 band of the 

15 XT-containing fractions was detected by western blot analysis with the polyclonal antibodies 
(Fig. 4 } panel B). 

Determination of the molecular weight of XT 

100 \xl of heparin affinity purified XT was separated under non-reducing and non- 
20 denaturating conditions using a TSK G3000 SW column. Two XT activity maxima were 
detected at MW 500.000 and 120.000 (Fig. 5, panel A). The molecular mass of the MW 
120.000 protein was reduced about 3 % after N-glycosidase F digestion as shown by SDS- 
PAGE (Fig. 5, panel B). 

25 PCR-based cloning of human XT-I cDNA 

Based upon the amino acid sequence of 4 peptides degenerate primers were designed for 
cloning the XT-I cDNA (Figure 6). When primers SPPS1 and Lysc-inv were used in a PCR 
with the first strand cDNA of SW1353 chondrosarcoma poly(A) + mRNA as template, a major 
band of approximately 690 bp was observed. After subcloning and sequencing a previously 

30 unknown DNA sequence was obtained from clone pCGl 14-29. PCR amplification with the 
primers DF1 and a mixture of Inv2b and Inv2c resulted in a 1724 bp fragment, which was 
subcloned and sequenced. The deduced amino acid sequence of the clone pCGl 11-4 was 
identical to 6 of the sequenced peptides from human XT-I. The cloning strategy of rapid 
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amplification of cDNA ends (RACE) (Chenchik et al, 1996, BioTechniques 21, 526-534) was 
employed to clone the complete coding sequence of the XT-I cDNA. The largest DNA 
fragment obtained from the 3 5 -RACE reaction was 1.6 kb and consisted of a 3' -untranslated 
region of 1240 bp. The 5 a RACE reaction was performed with different primers derived from 

5 the nucleotide sequence of XT-I cDNA, but the 5 '-untranslated region of human XT-I cDNA 
could not be cloned using this method. All DNA fragments obtained contained just the known 
cDNA sequence and an additional 80 bp of coding sequence. Thus, a PCR-based screening 
approach using cDNA libraries as template was employed for cloning of the 5 5 -untranslated 
region. All DNA fragments obtained from the screening of 3 different human cDNA libraries 

10 stopped at the same nucleotide, indicating that stable secondary structures of the XT-I mRNA 
prevent the synthesis of cDNA of the 5' untranslated region during the reverse transcription 
reaction. However, the translation initiation codon of human XT-I could not be yet cloned. The 
combined cDNA of human XT-I contained 3726 bp (Fig. 7A) with a single open reading frame 
encoding at least 827 amino acids with a molecular mass of a least 91.000. The deduced amino 

15 acid sequence contained 3 potential N-glycosylation sites (Figure 7B). Analysis of the amino 
acid sequence using the hydrophobic cluster analysis (Gaboriaud et al, 1987, ) revealed the 
presence of a common DxD motif at position 182, which has been shown to be essential for 
binding nucleotide-sugars in glycosyitransferases . 

20 PCR-based cloning of human XT-H isoform cDNA 

The degenerate primers PFF-sense and Lysc-invl which were designed upon the amino acid 
sequence of proteolytic cleaved peptides of human XT-I were used in a PCR reaction with first 
strand cDNA of placenta poly(A) + mRNA as template. The PCR amplification resulted in a 
minor band of 1.1 kb, which was cloned and sequenced. The determined nucleotide sequence 

25 was similar but not identical to the XT-I cDNA sequence, indicating that the fragment encodes 
a XT-II isoform. To clone the complete coding sequence of the novel cDNA, the RACE 
strategy was employed. The cDNA finally obtained was 3608 bp (Figure 8 A) and contained a 
single open reading frame encoding a protein of 865 amino acids with a molecular mass of 
97.000. The 3 '-untranslated region is 850 bp and a 5 '-untranslated region of 149 bp was 

30 identified with an in-frame stop codon upstream of the ATG codon. A Kyte-Doolittie 

hydropathy analysis (Kyte & Doolittle, 1982, J. Mol Biol 157, 105-132) of the deduced amino 
acid sequence revealed one potential membrane-spanning region consisting of 16 hydrophobic 
amino acid residues at position 16 to 32, which appears to result in a type II transmembrane 
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orientation characteristic of many of the glycosyltransferases. The predicted amino acid 
sequence contains a proline-rich profile pattern from position 1 10 to 1 1 8 and 3 potential N- 
glycosylation sites (Figure 8B). The human XT-II isoform protein exhibits 55% overall 
sequence identity to the human XT-I including stretches in the XT-II protein with more than 
5 80% homology to the human 
XT-I. 

Cloning of rat XT-I and XT-II cDN A 

4 

For amplification of the XT-I and XT-II cDNA from rat tissue gene-specific primers were 
10 designed based upon the nucleotide sequence of human XT-I and XT-II cDNA. Theses 

oligonucleotides were employed in PCR, 5' RACE and 3' RACE reactions with rat liver cDNA 
as template and a 2593 bp cDNA fragment (Fig, 9) coding for rat XT-I was obtained. 5' RACE 
reactions using a diversity of gene-specific primers were performed for identification of the 5' 
end of the XT-I cDNA. All cDNA fragments obtained stopped at exactly the same nucleotide 
15 position as observed in the human XT-I cDNA, indicating that stable secondary structures of 
the XT-I mRNA inhibit cDNA synthesis of the 5' region. The XT-II cDNA fragment finally 
obtained after PCR amplification with rat liver cDNA was 2782 bp (Fig. 10) and included the 
entire coding region of rat XT-II. The deduced amino acid sequences of XT-I and XT-II (SEQ 
IDs 6, 8) from rat each displayed 94% identity to the corresponding human xylosyltransferase, 
20 Alignment of all amino acid sequences revealed the presence of highly conserved amino acid 
clusters in the central and carboxyterminal regions of the proteins. 

Identification of the chromosomal localization of the human XT-I and XT-II genes 
The gene coding for human XT-I could be assigned to the short arm of chromosome 16 at the 
25 region 1 6pl 3 . 1 . The radiation hybrid mapping using the Genebridge 4 radiation hybrid 
screening panel located the position of the PCR fragment 5.98 cR distal to the STS marker 
CHLC.GATA42E1 1 and 4.40 cR proximal to marker D16S499. The XT-II gene was identified 
on chromosome 17 at the position 17q2L3-17q22 and was found to be located distal to the STS 
marker WI-1 1424 and proximal to the marker WI-143 15. 

30 

Expression of recombinant XT 

To prove the function of the cDNA product, a recombinant form of XT-I was generated by 
fusing the cloned XT-I to an aminoterminal peptide tag. The fused protein was expressed in 
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CHO-K1 cells and absorbed from the medium by immunoprecipitation with anti-^restf- 
antibodies and protein G agarose beads to eliminate endogenous XT activity. The enzyme- 
bound beads were used as an enzyme source and assayed for XT activity as shown in Table 3 
(see below). No detectable XT activity was recovered by the affinity purification from a control 

5 transfection sample. The substrate specificity of the recombinant XT-I was similar to that of the 
XT-I isolated from human body fluids and cell culture supernatants. The recovered enzyme 
activity of the recombinantly expressed XT-I could be completely inhibited by addition of 250 
U of heparin. As specific inhibition of human XT activity, by heparin has been demonstrated 
previously (Kleesiek et al, 1987, I.e.), these results clearly indicate that the expressed protein is 

10 the human XT. However, no enzymatic transfer of xylose to the acceptor peptides used in this 
study was observed when expressing XT-II fused to the aminoterminal peptide tag in CHO-K1 
cells. 

To identify the XT-I reaction products, the bikunin peptide was labeled with [ 14 C]-D-xylose 
15 using the XT-I-bound beads as an enzyme source. The products were subsequently subjected to 
the linkage-specific digestion of the bound [ I4 C]-D-xylose with a- and p-xylosidase and 
alkaline p -elimination. Incubation of the reaction products with P-xylosidase resulted in the 
release of 74% of the incorporated [ 14 C]xyIose, whereas only less than 4% of the peptide-bound 
xylose was digested after treatment with a-xylosidase. The alkaline cleavage of the O- 
20 glycosidic linkage between the xylose and the p-hydroxyamino acid serine in the presence of 
borohydride lead to the liberation of more than 97% of the enzymatically transferred 
[ I4 C]xylose. In all the experiments performed no significant amount of [ I4 C]-D-xylose was 
incorporated without the addition of the bikunin peptide as acceptor. These results clearly 
indicate that a xylose residue was transferred to the hydroxyamino acid serine of the bikunin 
25 peptide through a p-linkage. In conclusion the expressed protein was identified as UDP-D- 
xyloseiproteoglycan core protein P-D-xylosyltransferase (EC 2.4.2.26). 
Table 3: 

Xylosyltransferase activity of recombinant XT-I expressed in CHO-K1 cells. XT activity of the 
en2yme fractions using different acceptors and the inhibition of enzyme activity by addition of 
30 heparin is shown. The synthetic bikunin, L-APP and L-APLP2 homologous peptides have 

been previously proved to be good acceptors for XT mediated xylosylation (Brinkmann et aL, 
1997; Gotting et al y 1998). No XT activity was detected in samples from mock-transfected 
cells after affinity purification, n.d., not detected (detection limit, 20 (iU/1). 
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Acceptor XTActivitiy 





pCG227-XT 






+ heparin 


Recombinant biknnin 


2854 


n.d. 


Bikunin peptide QEEGSGGGGQK 


463 


n.d. 


L-APLP2 peptide SENEGSGMAEQK 


515 


n.d. 


L-APP peptide TENEGSGLTNIK 


492 


n.d. 


Peptide SGG 


n.d. 


n.d. 


Chondroitin sulfate A 


n.d. 


n.d. 


Chondroitin sulfate C 


n.d. 


n.d. 



Tissue-specific expression of XT-land the XT-II isoform 

The expression of XT-I and XT-II isoform gene was examined in various human tissues using a 
RT-PCR based method with normalized first-strand cDNA. Each PCR yielded a single product 

5 with predicted nucleotide lengths of 490 bp for XT-I and 717 bp for XT-II, although the amount 
of the amplified product varied (Figure 11). Amplification of XT-I and XT-II fragments 
resulted in a product visible by ethidium bromide staining after 36 cycles, whereas the DNA 
fragment corresponding to the abundant glyceraldehyde-3-phosphate dehydrogenase transcript 
was visible after 20 cycles. The greatest abundance of XT-I expression was detected in the 

10 placenta, kidney and pancreas and only a very weak expression was detected in skeletal muscle. 
The greatest abundance of XT-II isoform is expressed in the kidney and pancreas. 

Cellular distribution of XT activity in cultured CHO-K1 cells 

After an incubation period of 3 days cultured CHO-K1 cells were harvested and the XT activity 
1 5 was determined in the spent culture supernatant and the cell lysates. 92% of the total XT 
activity was found to be located in the cell culture medium (93.1 mU/10 6 cells, SD 9-58), 
whereas only 2% was detected in the cell lysates (2.03 mU/10 6 cells, SD 0.44). 6% of the total 
XT activity (5.82 mU/10 6 cells, SD 2.18) was released from the membrane-containing fractions 
after addition of the detergence Triton X-100 indicating that less than 1/1 0th of XT is bound to 
20 cellular membranes. 

Pharmaceutical and diamostic use 

The xylosyltransferase enzymes according to the invention can be used directly as therapeutic 
drug in pathological situations where a deficiency of said enzyme and its isoforms can be 
25 detected. As pointed out above XT enzyme may be overexpressed in some diseases and 
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disorders. In these cases inhibitors and / or antagonists of XT-I and or XT-II can be used. Thus, 
the invention relates also to the use for the manufacture of a medicament for the treatment of 
diseases which are caused by increased levels of said enzymes } wherein said medicament is an 
inhibitor or antagonist of xylosyltransferase. 

5 As mentioned above the protein according to this invention can be used as diagnostic means to 
evaluate pathological conditions. As used herein, the term "pharmaceutical^ acceptable carrier" 
means an inert, non toxic solid or liquid filler, diluent or encapsulating material, not reacting 
adversely with the active compound or with the patient. Suitable, preferably liquid carriers, are 
well known in the art. The formulations according to the invention may be administered as unit 

10 doses containing conventional non-toxic pharmaceutically acceptable carriers, diluents, 
adjuvants and vehicles which are typical for parenteral administration. 
The term "parenteral" includes herein subcutaneous, intravenous, intra-articular and 
intratracheal injection and infusion techniques. Also other administrations such as oral 
administration and topical application are suitable. Parenteral compositions and combinations 

1 5 are most preferably adminstered intravenously either in a bolus form or as a constant fusion 
according to known procedures. Tablets and capsules for oral administration contain 
conventional excipients such as binding agents, fillers, diluents, tableting agents, lubricants, 
disintegrants, and wetting agents. The tablets may be coated according to methods well known 
in the art. 

20 Unit doses according to the invention may contain daily required amounts of the protein 

according to the invention, or sub-multiples thereof to make up the desired dose. The optimum 
therapeutically acceptable dosage and dose rate for a given patient (mammals, including 
humans) depends on a variety of factors, such as the activity of the specific active material 
employed, the age, body weight, general health, sex, diet, time and route of administration, rate 

25 of clearance, enzyme activity (units/mg protein), the object of the treatment, i. e., therapy or 
prophylaxis and the nature of the disease to be treated. 

Therefore, in compositions and combinations in a treated patient (in vivo) a pharmaceutical 
effective daily dose of the protein of this invention (hXT, hXT-I, hXT-II) is between about 0.01 
and 1 00 mg/kg body weight (based on a specific activity of 1 00 kU/mg), preferably between 
30 0.1 and 10 mg/kg body weight. According to the application form one single dose may contain 
between 0.5 and 10 mg of hXT. 
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Short Description of the Figures 
Figure 1; Purification of XT 

Panel A. Heparin affinity chromatography on POROS20 HE: The dissolved ammonium sulfate 

5 precipitate from JAR choriocarcinoma supernatant was applied to a POROS 20 HE column. 

After equilibration with buffer A, the column was eluted stepwise with NaCl ( ). Protein 

elution was monitored at wavelength A 2 8o ( ), and fractions of 38 ml were assayed for XT 

activity (■ ). The horizontal bracket indicates fractions collected for further purification. 
Panel B. Ion exchange chromatography owPOROS 20 HQ : The desalted and concentrated 

10 product from heparin affinity chromatography was loaded onto a POROS 20 HQ column. The 
column was washed with buffer A. Adsorbed proteins were eluted stepwise with NaCl ( ~ ). 

Protein elution was monitored at A280 ( ), and fractions of 50 ml were assayed for XT 

activity (■). The horizontal brackets indicate the fractions collected for further purification. 
Panel C Protamine chloride affinity chromatography: The desalted and concentrated product 

15 from ion exchange chromatography was applied to a protamine chloride POROS column. After 
a washing step with buffer A, the column was eluted stepwise with NaCl 

( ). Elution was monitored at A280 ( ), and fractions of 6 ml were assayed for XT 

activity ( ■ ). The horizontal bracket indicates the fractions collected. 
Figure 2 SDS-PAGE of XT fractions at various purification steps 

20 Panel A: XT fractions at various purification steps were subjected to 4-12% 

gradient polyacrylamide gel for SDS-PAGE. Lane I JAR cell culture supernatant 

(crude material); lane II ultrafiltration retentate; lane III, dissolved ammonium 

sulfate precipitate; lane IV, protein eluted with NaCl from the POROS 20 HE column; lane 

V, protein eluted with NaCl from the POROS 20 HQ column; lane VI protein eluted with NaCl 

25 from the protamine chloride POROS column. Lane M t molecular size standard: myosin 

(200.000), phosphorylase b (97.000), bovine serum albumin (66.000), glutamic dehydrogenase 
(55.000), carbonic anhydrase (31.000), trypsin inhibitor (22.000). The gel was silver-stained. 
Panel B: Collected XT fractions from protamine affinity chromatography were subjected to 
SDS-PAGE on a 4-12% gradient polyacrylamide gel. The arrows indicate the stained bands 

30 corresponding to the unknown protein, which was shown to be XT (120.000) (1), hexose-6- 
phosphate dehydrogenase (89.000) (2), ezrin (68.600) (3), quiescin Q6 (64.000) (4\ 
plasminogen activator (47.000) (5), aldolase A (39.000) (6) and low molecular artefacts (7). All 
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protein bands were excised, enzymaticaUy digested, and the peptide mixture was characterized 
by MALDI-TOF mass spectrometry. The gel was stained with Coomassie Brilliant Blue. 
Figure 3: Immunoaffinity chromatography of xylosyltransferase 

Panel A. 100 \A of the desalted XT-containing fractions eluted from heparin affinity matrix was 
5 applied to a column of immobilized polyclonal antibodies. After washing with buffer D, the 
column was eluted as indicated by the arrow with buffer D / 0.15 M NaCl (1) followed by 12 
mM HC1 (2). Fractions of 1 ml were collected into tubes containing 1 ml of 0.1 M Tris/HCl, pH 

8.0. Protein elution was monitored at 280 nm ( ) and.the XT activities (■) of each fraction 

were assayed. 

10 Panel B. Negative control of the immunoaffinity chromatography with immobilized preimmune 
serum. For conditions see Panel A. 

Panel C. Immunoaffinity chromatography with immobilized polyclonal antibodies. The column 
was eluted with buffer D containing 100 \xl (1 mg/ml) of the peptide antigen indicated by the 
arrow. Fractions of 1 ml were collected into tubes containing 1 ml of 0.1 M Tris/HCl, pH 8.0. 

15 Protein elution was monitored at 280 nm ( ) and the XT activities (■) of each fraction 

were assayed. 

Figure 4: Aprotinin affinity chromatography of partially purified XT and 
immunoblot analysis of the separated fractions 

Panel A t 200 (il desalted XT fraction from the heparin purification step was applied to an 
20 aprotinin column previously equilibrated with buffer A. After washing with buffer A, the 

adsorbed proteins were eluted stepwise with NaCl ( — ). Protein elution was monitored at 

A280 ( ) and fractions of 2 ml were assayed for XT activity (■). 

Panel B t Aliquots of fraction 3, fraction 7, fraction 12 and fraction 16 were analyzed by western 
blot. The MW 120.000 protein was detected in the XT-containing fraction 7 and 12. 

25 Immunological detections were performed using a polyclonal rabbit antiserum raised against 
the synthetic peptide CSRQKELLKRKLEQQEK deduced from the peptides 2 and 10 of the 
enzymatically cleaved unknown protein. Prestained molecular size standard were myosin 
(190.000), BSA (64.000), glutamic dehydrogenase (51.000). 
Figure 5 : Gel filtration chromatography and N-glycosidase F digestion 

30 Panel A t 100 \il of the XT containing fraction from heparin affinity HPLC was applied to a 
TSK G3000 SW column (30 cm X 7.5 mm, 10 nm particle size) previously equilibrated with 
buffer A / 0.1 5 M NaCl. Elution was performed with the same buffer. Fractions of 200 \il were 
collected and assayed for XT activity (■). Protein elution was monitored at A280 ( )- The 
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arrows indicate the elutiou positions of thyroglobulin (669.000) (I), ferritin (440.000) (2), 
aldolase (158.000) <3), albumin (67.000) (4\ ovalbumin (43.000) (5\ chymotrypsinogen A 
(25.000) (6), andribonuclease A (13.700) (7). 

Panel B t An aliquot of 1 \ig of the MW 120.000 protein was analyzed by SDS-PAGE 

5 (I). Aliquots (1 ^g) of the MW 120.000 protein were digested with 3,1x10 " 3 units of N- 
glycosidase F at 37°C for 1 h (II) and 12 h (HI). The samples were then subjected to SDS- 
PAGE, and protein bands were detected by silver staining. The molecular size standard were 
myosin (200.000), b-galactosidase (1 16.000), phosphorylase b (97.000). 
Figure 6: Cloning strategy for human XT-I and XT-II isoform. 

10 (A) The peptides 4, 6, 7, and 9 are sequences that were most favorable for the design of 
degenerate PCR primers. The amino acid sequence of the peptides was obtained after 
proteolytic digestion of the purified human XT. The strategy for cloning of XT-I cDNA (B) and 
XT-II isoform (C) is illustrated. The open reading frame of XT-I and XT-II is shown as a filled 
box, and the location of the peptides 4, 6, 7 and 9 is illustrated by open boxes. The location and 

15 orientation of the degenerate primers employed for cloning of XT-I and XT-It are marked by 
arrows. XT-I and XT-II cDNA inserts contained within the indicated plasmids were obtained 
using RT-PCR with degenerate primers (pCGl 1 1-4, pCGl 14-29, pCGl 1 0-7), 5' RACE 
(pCG185-21, pCG212-19, pCG319-23, V RACE (pCG204-38, pCG21 1-4) and RT-PCR with 
gene-specific primers (pCG176-l). H = A + C + T, Y = C + T, R = A + G, N = A +G + C + T, I 

20 = deoxyinosine. 

Figure 7 A, B: Nucleotide (A) and deduced amino acid (B) sequence of the hXT-L 
The position of the peptide sequences obtained by digestion of the purified human XT are 
underlined (B). Potential N-glycosylation sites are double underlined. The DxD motif is 
intensified depicted. The figures are identical with SEQ ID 1 and 2. 
25 Figure 8A.B: Nucleotide (A) and amino acid (B) sequence of the hXT-II isoform. 

Potential N-glycosylation sites are double underlined (B). The putative transmembrane domain 
is underlined. The figures are identical with SEQ ID 3 and 4. 

Figure 9: Nucleotide sequence of rXT-I (SEQ ID 5). The corresponding protein sequence is 
depicted in SEQ ID 6. 

30 Figure 10: Nucleotide sequence of rXT-II (SEQ ID 7). The corresponding protein sequence is 
depicted in SEQ ID 8. 

Figure 11 : Differential expression of the XT-l XT-II gene in human tissues. Semiquantitative 
RT-PCR with normalized first-strand cDNA was used to examine the abundance of XT-I and 
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XT-II transcripts. A 983 bp fragment of the housekeeping gene glyceraldehyde-3 -phosphate 
dehydrogenase cDNA was amplified as control (A). The 490 bp XT-I cDNA fragment (B) and 
the 717 bp fragment amplified from XT-II isoform cDNA (C) were detected in each tissue 
indicating that both enzymes are ubiquitously expressed. The arrows indicate the expected 
5 position for each PCR product 

The following examples describe the invention in more detail without to limit scope of the 
technical teaching. 

Example 1: Materials for Isolation and Purifications. Human JAR choriocarcinoma cells 

10 were purchased from ATCC (Rockville, MD). Dried UltraDOMA-PF medium was obtained 
from BioWhittaker (Vervier, Belgium) and aqua ad injecta from Braun (Melsungen, Germany). 
Heat-inactivated fetal calf serum, Dulbecco's phosphate-buffered saline, antibiotic/antimycotic 
solution, trypsin-EDTA solution, trypan blue, protamine chloride and the Bicinchoninic Acid 
Protein Assay Kit were purchased from Sigma (Deisenhofen, Germany). Cell culture 

15 flasks, serological pipettes, and sterile tubes were purchased from Becton 

Dickinson (Heidelberg, Germany). The hybrid hollow-fiber bioreactor TECNOMOUSE® 
was supplied by Integra Biosciences (Fernwald, Germany), the ACA analyzer by 
Dade Diagnostica (Munchen, Germany) and the Super G analyzer by RLT 
(Mdhnesee, Germany). UDP-[ 14 C]xylose (9.88 kBq / nmol) came from DuPont (Bad 

20 Homburg, Germany), 25 mm diameter nitrocellulose discs from Sartorius (Gottingen, 
Germany), scintillation mixture and the liquid scintillation counter LS500TD was obtained 
from Beckman Coulter (Fullerton, CA). Ultrafiltration cells, YM1 membranes and PVDF 
membranes (Immobilon P) were purchased from Millipore (Eschborn, Germany). 
The chromatography media POROS 20 HQ, POROS 20 HE2, POROS 20 AL, POROS 20 EP 

25 and the HPLC workstation Biocad Sprint were supplied by Perseptive 

Biosystems (Framingham, MA). The gel filtration column TSK G3000 SW (30 cm x 7.5 mm, 
10 \xm particle size) was obtained from TosoHaas (Montgomeryville, PA). The MALDI mass 
spectrometer Reflex II was from Bruker Daltonik GmbH (Bremen, Germany) and protein 
sequencer Procise 494 cLC was purchased from PE Biosystems (Framingham, MA). Precast 

30 polyacrylamide gels, buffers, and NuPAGE electrophoresis system XCell II Mini-Cell and Blot 
Module were from Novex (San Diego, CA). The synthetic peptide CSRQKE.LLKRKLEQQEK 
and the rabbit antiserum were purchased from Bioscience (GOttingen, Germany). Peroxidase- 
conjugated affinipure F(ab')2 fragment goat anti-rabbit IgG (H+L) was purchased from 
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Dianova (Hamburg, Germany). N-glycosidase F was obtained from Roche (Mannheim, 
Germany). 

Example 2: Cell culture. JAR choriocarcinoma cells releasing XT in the cell culture 
supernatant were cultured in Ultradoma-PF medium containing 10% heat-inactivated fetal calf 

5 serum, 100 U/ml penicillin, 100 |ig/ml streptomycin and 250 ng/ml amphotericin B in a 

humidified atmosphere of 5% C02 and 95% air at 37°C. After incubation for 24 h in the serum- 
containing medium, the cell cultivation was adapted to serum-free conditions as described 
previously (6). Scaling up of XT production was carried out in three hybrid hollow-fiber 
bioreactors TECNOMOUSE®. During the exponential growth the cells from three 175 cm 2 T- 

10 flasks (> 3 x 10 7 cells) were detached with 0.5% trypsin and 0.2% EDTA in Duibecco's 

phosphate-buffered saline by incubation at 37°C for 10 min. After centrifugation (5 min, 1000 x 
g) of the cell suspension the cell pellet was resuspended in 10 ml 37°C warm serum-free and 
protein-free Ultradoma-PF medium and washed three times with 20 ml of the same medium. 
The cell suspension was drawn into a 10 ml syringe and then inoculated into the extracapillary 

15 space (EC space) of the reactor. The bioreactor was connected with a 2 liter medium bottle and 
set to 150 ml/h in the recirculation mode, and the oxygenation pump was set as described in the 
operating manual. Five days after inoculation a 10 ml syringe was connected to the left hand 
EC port and 10 ml of cell culture supernatant was harvested from the EC space. The harvesting 
was continued every two days over a period of 3 months. Glucose and lactate concentration of 

20 the cell culture supernatant were controlled using the Super G analyzer and the ACA analyzer, 
respectively. The 2 liter medium bottle was replaced every three days. The viability of the cells 
was determined by trypan blue exclusion. 

Example 3: Synthesis of the protamine affinity matrix. Protamine chloride was immobilized as 
ligand on POROS 20 AL. 30 mg ligand was dissolved in 10 ml of 10 mM phosphate / 0.15 M 

25 NaCl, pH 7.4. After the protein had been dissolved, 5 ml of 1 00 mM phosphate / 1 .50 M NaCl, 
pH 7.4 was added. NaCNBH3 was dissolved in the ligand/buffer solution to a final 
concentration of 5 mg/ml and 1.0 g POROS 20 AL was suspended in the same solution. 
The suspension was mixed gently on a shaker for 1 minute at RT. An additional 2 ml of 100 
mM phosphate / 1 .50 M NaCl, pH 7.4 was added to the suspension and the mixture was shaken 

30 continuously. This step was repeated every five minutes until the mixture volume was 25 ml. 
After additional shaking for 2 h, the medium was filtered on a sintered glass funnel. The matrix 
was suspended in 20 ml of 0.2 M Tris/HCl / 5 g/l NaCNBH 3 , pH 7.2 and mixed gently on a 
shaker for 30 min at RT. After the media had been washed in a sintered glass funnel using 1 00 
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ml of 10 mM phosphate, pH 7.4, 100 ml of 1.0 M NaCl and another 100 ml of 10 mM 
phosphate, pH 7.4, the matrix was packed in a PEEK column (4.6 x 50 mm). 
Example 4: Synthesis of the aprotinin affinity matrix. Aprotinin affinity matrix was synthesized 
according to the synthesis of the protamine affinity matrix, but using aprotinin as ligand. After 

5 immobilization of the ligand the matrix was packed in a PEEK column (4.6 x 50 mm). 
Example 5: Purification of xylosyltransferase from JAR cell culture supernatant. 
Fractionated ammonium sulfate precipitation and. chromatography steps were performed at RT, 
ultrafiltration and diafiltration were carried out at 4°C. 1 8-5 liters JAR cell culture supernatant 
collected from three hybrid hollow-fiber bibreactors, TECNOMOUSE®, each containing 5 

10 culture cassettes, was concentrated to 800 ml with ultrafiltration cells using YM1 cellulose 
membranes. The retentate was centrifuged at 4,000 x g for 1 h. The supernatant was decanted, 
and the pellet was discarded. 

Step 1: Fractionated ammonium sulfate precipitation - Solid ammonium sulfate was added to 
the supernatant to 28% saturation. After 1 h at RT the suspension was centrifuged at 4,000 x g 

15 for 2 h 5 the supernatant was decanted, and the precipitate was removed. Additional ammonium 
sulfate was added to the solution to the point of 40% saturation, and the suspension was 
allowed to stand for 1 h. To recover the precipitate the supernatant was decanted after the 
suspension was centrifuged at 4,000 x g for 2 h. Before chromatography on immobilized 
heparin the precipitate was dissolved in 460 ml buffer A (20 mM sodium acetate, pH 6.0). 

20 Step 2: Heparin affinity chromatography on POROS 20 HE2 - The step 1 product was passed 
through a 0.2 \im filter. 4.0 ml of the filtrate was applied to a POROS 20 HE2 column (16 x 100 
mm) equilibrated with buffer A at a flow rate of 40 ml/min. After washing the column with 100 
ml of buffer A the XT activity was eluted with the same buffer containing NaCl. The NaCl 
concentration was increased stepwise: 20 ml buffer A / 0.09 M NaCl; 20 ml buffer A / 0. 1 5 M 

25 NaCl; 30 ml buffer A / 0.24 M NaCl; 24 ml buffer A / 0.30 M NaCl; 24 ml buffer A / 0.60 M 
NaCl; 24 ml buffer A/ 1.00 M NaCl; and 24 ml buffer A/ 1.89 M NaCl. Fractions of 38 ml 
each were collected and the XT activity was measured. The procedure was repeated 115 times 
by cyclic chromatography and the fractions containing XT activity (115x38 ml) 
were collected. 

30 Step 3: Ion exchange chromatography on POROS 20 HQ - Collected fractions from step 2 were 
desalted using diafiltration with YM1 cellulose membranes and ultrafiltration cells. After 
concentration of the desalted protein solution to 0.05 liter using analogous techniques the XT- 
enriched solution was subjected to ion exchange chromatography. 4.0 ml of the XT solution 



WO 01/49831 



- 24 - 



PCT/EPOO/13311 



was applied onto the POROS 20 HQ column (16 x 100 mm) previously equilibrated with buffer 
A at a flow rate of 40 ml/min. The column was washed with 80 ml buffer A, and the adsorbed 
protein was eluted stepwise using the same buffer containing 0.07 M NaCl (88 ml), 0.18 M 
NaCl (120 ml), and 0.36 M NaCl (120 ml) followed by a linear gradient of 0.36 - 1.00 M NaCl 
5 (200 ml) and another step of buffer A / 2.0 M NaCl (120 ml). 50 ml fractions were collected 
and assayed for activity and evaluated by SDS-PAGE. Chromatography was repeated 13 times, 
and the fractions exhibiting XT activity (13 x 50 ml) were collected for affinity 
chromatography. 

Step 4: Affinity chromatography on protamine chloride - XT-containing solution from step 3 

10 was desalted as described above and concentrated to 5 ml by ultrafiltration with YM1 cellulose 
membranes. The ultrafiltration product was passed through a 0.2 \xm filter. 100 |il of the filtrate 
was loaded onto a protamine chloride - POROS column (4.6 x 50 mm) equilibrated with buffer 
A. The flow rate was 10 ml/min. The column was washed with 10.0 ml of buffer A, and the 
adsorbed fraction was eluted with the same buffer containing NaCl by a stepwise increase of 

15 the NaCl concentration: 6.6 ml buffer A / 0.04 M NaCl; 6.6 ml buffer A / 0,06 M NaCl; 6.6 
ml buffer A / 0.23 M NaCl followed by a linear gradient of 0.23 - 1 .20 M NaCl (4.2 ml) 
in buffer A. Fractions of 6.0 ml were collected, assayed for XT activity and evaluated by SDS- 
PAGE. Cyclic chromatography was repeated 50 times. The purified enzyme was collected, 
concentrated to L0 ml using ultrafiltration techniques and stored at - 75 °C. 

20 Step 5: SDS-Potyacrylamide gel electrophoresis (SDS-PAGE) - The protein composition of 
* various fractions was estimated by SDS-PAGE. Briefly, 12.1 \xl of sample was added to 4.7 \il 
of sample buffer (1.00 M Tris/HCl / 1.17 M sucrose, 0.28 M SDS, 2.08 mM EDTA, 0.88 mM 
Serva Blue G250, 0.70 mM phenol red, 0.10 M DTT, pH 8.5) and heated for 10 minutes at 
99°C. After the sample had been loaded, SDS-polyacrylamide gel electrophoresis was carried 

25 out on a 4-12% bis-tris polyacrylamide gel with 3-(N-morphoIino) propanesulfonic acid 

(MOPS) running buffer (1.00 M MOPS/Tris / 69.3 mM SDS, 20.5 mM EDTA, pH 7.7). Protein 
bands were detected by Coomassie Brilliant Blue or by silver staining. The Coomassie bands 
were excised and characterized by MALDI mass spectrometry and amino acid sequence 
analysis. 

30 Example 6: MALDI mass spectrometry. Coomassie-stained proteins were excised from the gel, 
repeatedly washed with H2O and H20/acetonitrile and digested overnight with trypsin and 
endoproteinase Lys-C at 37°C. The peptides generated in the supernatant were analyzed by 
MALDI mass spectrometry. Sample preparation was achieved following the thin film 
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preparation techniques (13). Briefly, aliquots of 03 \x\ of a nitrocellulose containing saturated 
solution of a-cyano-4- hydroxycinnamic acid in acetone were deposited onto individual spots 
on the target. Subsequently, 0.8 jil 10% formic acid and 0.4 ^1 of the digest sample was loaded 
on top of the thin film spots and allowed to dry slowly at ambient temperature. To remove salts 

5 from the digestion buffer the spots were washed with 10% formic acid and with H 2 0. MALDI 
mass spectra were recorded in the positive ion mode with delayed extraction on a Reflex II 
time-of-flight instrument equipped with a SCOUT multiprobe inlet and a 337 nm nitrogen laser. 
Ion acceleration voltage was set to 20.0 kV, the reflector voltage was set to 21 .5 kV and the 
first extraction plate was set to 15.4 kV. Mass spectra were obtained by averaging 50-200 

10 individual laser shots. Calibration of the spectra was performed internally by a two-point linear 
fit using the autolysis products of trypsin at m/z 842.50 and m/z 221 LI 0. 
Example 7: Amino acid sequence analysis of XT. The MW 120.000 Coomassie-stained protein 
was excised from the gel, repeatedly washed with H2O and H20/acetonitrile and digested with 
trypsin and endoproteinase Lys-C overnight. For HPLC separation the excised gel fragment 

15 was extracted twice with 0.1 % TFA/60% acetonitrile. The extracted enzymatic fragments were 
separated on a capillary HPLC system equipped with a HOB solvents delivery system 
(PE Biosystems), Acurate splitter (LC-Packings), UV absorbance detector 75 9 A 
(PE Biosystems), U-Z capillary flow cell (LC-Packings) and Probot fraction collector 
(BAI) using reversed-phase column (Hypersil CI 8 BDS, 3 ^m, 0.3 x 150 mm) and a linear 

20 gradient from 12% acetonitrile, 0. 1% TFA to 64% acetonitrile, 0.08% TFA in 90 min with a 
flow rate of 4 [il/min at RT. Peptide elution was monitored at 214 nm and individual fractions 
from the HPLC separation were analysed by MALDI mass spectrometry. Sequence analysis 
of separated fragments was performed on a Procise Protein Sequencer 494 cLC using standard 
programs supplied by PE Biosystems. 

25 Example 8: Determination of XT activity. Determination of XT activity is based on the 
incorporation of [ 14 C]-D-xylose into the recombinant bikunin according to a previously 
described method (Brinkmann et al., 1997, J. Biol. Chem., 272, 11171-11175). For analysis of 
the substrate specificity of the recombinant xylosyltransferases synthetic peptides containing 
the XT recognition sequence were used as acceptor. The reaction mixture for the assay 

30 contained, in a total volume of 100 |xl: 50 fil of XT solution, 25 mM 
4-morpholineethanesulfonic acid (pH 6.5), 25 mM KCl, 5 mM KF, 5 mM MgCl 2 , 5 mM 
MnCl 2 , 1.0 \M UDP-[ 14 C]-D-xylose, and 1.5 ^M of the synthetic peptides. After incubation for 
75 min at 37°C, the reaction mixtures were placed on discs (25 mm diameter) of Immobilori- 
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AV membrane, which immobilizes even small peptides by covalent links (Pfund & Bourdage, 
1990, Mol Immunol. 27, 495-502), and allowed to dry. It was then washed for 10 min with 
10% trichloroacetic acid and three times with 5% trichloroacetic acid solution. Incorporated 
radioactivity was determined by liquid scintillation counting. 

5 Example 9: Antiserum Preparation. The synthetic peptide CSRQKELLKRKLEQQEK 
deduced from the sequenced peptides 2 and 10 of the enzymatically cleaved XT was 
synthesized, purified by HPLC and used for immunization of rabbits (Bioscience, Germany). 
Polyclonal antiserum was obtained by injection of the above antigen followed Example 10: 
Preparation of solid-phase antigen, the antigen CSRQKELLKRKLEQQEK was 

10 immobilized on POROS 20 EP. After 1 .6 mg antigen was dissolved in 1 .2 ml of 1 0 mM 
phosphate / 0.15 M NaCl, pH 7.4 0.60 ml of 100 mM phosphate / 1.50 MNaCl, pH 7.4 was 
added. 400 mg POROS 20 EP was suspended in the solution and the suspension was mixed 
gently on a shaker at RT. At 10-min intervals, five times in total, an additional 0.24 ml of 100 
mM phosphate / 1 .50 M NaCl, pH 7.4 was added to the suspension. After additional shaking for 

15 5 days at RT the suspension was filtered on a sintered glass funnel The matrix was suspended 
in 4 ml 0.2 M phosphate / 0.1 M 2-mercaptoethanol, pH 7.4 and mixed on a shaker for 2 h at 
RT. The matrix was washed in a sintered glass funnel using 20 ml of 10 mM phosphate / 0.15 
M NaCl, pH 7.4 and 20 ml of 1 .0 M NaCl. After additional washing with 20 ml of 1 0 mM 
phosphate, pH 7.4, the matrix was packed in a PEEK column (4.6 x 50 mm). 

20 Example 11: Antibody purification. Antiserum was adjusted to 50 mM Tris/HCl, pH 8.0. 
The solution was clarified by passage through a 0.2 \xm filter. 0.4 ml filtrate was applied at 10 
ml/min to the antigen column previously equilibrated with buffer B (50 mM Tris/HCl, pH 8.0). 
After the column was washed with 4. 1 ml buffer B and with 12.5 ml buffer B / 0. 1 5 M 
NaCl, the adsorbed antibody was eluted using 12.4 ml of 50 mM sodium citrate / 0.15 M NaCl, 

25 pH 3.0 followed by 3.4 ml of 100 mM sodium citrate / 1.5 M NaCl, pH 3.0. The eluate was 
collected as 10 ml fractions in tubes containing 2 ml of 0.5 M Tris/HCl, pH 8.0, to immediately 
neutralize the citric acid. 

Example 12: Preparation of immunoaffinity column. Purified antibody was concentrated 
to a protein concentration of 0.3 mg/ml using ultrafiltration with YM1 cellulose membranes. 
30 The antibody solution was adjusted to 10 mM phosphate / 0.15 M NaCl, pH 7.4. After filtration 
of the solution through a 0.2 ^m filter 100 \xl filtrate was applied at 0.2 ml/min to a POROS 20 
PA column (2.1 x 30 mm). This step was repeated 17 times. Adsorbed antibody was cross- 
linked using cross-linking solution (100 mM triethanolamine, pH 8.5). After the column was 
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washed with 5 ml of buffer C (10 mM phosphate / 0.15 MNaCl, pH 7.4) 2 ml cross-linking 
solution was applied at 0.5 ml/min onto the cartridge. The procedure was repeated 6 more 
times, using a total volume of 14 ml cross-linking solution. To block unreacted functional 
groups on the cross-linking reagents 2 ml of 100 mM monoethanolamine, pH 9.0 

5 (quenching solution), was loaded onto the cartridge at 0.5 ml/min. The column was 
washed using 2 ml of buffer C and the cross-linking step was repeated using another 2 
ml quenching solution. The immunoaffinity column was cycled between buffer C and 12 mM 
HC1 / 0, 15 M NaCl 3 times using a total volume of 12 ml .of solution. 
Example 13; Immunoaffinity column purification of XT. XT-containing fractions eluted 

10 from the heparin affinity matrix were desalted using diafiltration with YM1 cellulose 

membranes and passed through a 0.2 [im filter. 100 jal of this XT-sample was applied to the 
immunoaffinity column equilibrated with buffer D (20 mM Tris/HCl, pH 8.0) at a flow rate of 1 
ml/min. The column was washed with 1 .4 ml of buffer D and with 8.5 ml of buffer D / 0. 1 5 M 
NaCl. The XT activity was eluted with 4.2 ml of 12 mM HC1 followed by 1 .2 ml of 12 mM 

15 HC1 / 1 .5 M NaCl. Alternatively the elution was performed using 1 00 \x\ of antigen at 1 mg/ml 
in buffer D. Fractions (1 ml) were collected into tubes containing 1 ml of 0.1 M Tris/HCl, 
pH 8.0. The XT activities of the fractions were determined. 

Example 14; Aprotinin affinity chromatography. 200 \i\ of desalted XT solution from the 
heparin purification step was applied at 10 ml/min to the aprotinin column previously 

20 equilibrated with buffer A. After washing the column with 6.6 ml of buffer A the adsorbed ' 
protein was eluted stepwise using the same buffer containing 0.3 M NaCl (10.0 ml), 0.54 M 
NaCl (1 0.0 ml), 1 .00 M NaCl (10.0 ml) and 1 .50 M NaCl (2.4 ml). Example 15: Western 
Blot Analysis. For western blot analysis, proteins were transferred to polyvinylidene 
difluoride membrane in a semi-dry instrument (Novex). After transfer nonspecific 

25 antibody binding sites were blocked with 2% BS A in 0. 1 M Tris/HCl, pH 7.2, for 1 h at RT. 
The membrane was incubated with antiserum in 50 mM phosphate / 0.15 MNaCl, 0.5 
ml/1 Tween 20, pH 7.4 at 1:1000 dilution for 1 h. Bound antibody was detected using a second 
anti-rabbit goat immunoglobulin coupled to horse-radish peroxidase at a 1:1000 dilution. The 
blot was developed using 4-chloro-l-naphthol. 

30 Example 16: Gel filtration chromatography. A sample of 100 nl from the heparin 

purification step was applied at 1.0 ml/min to a TSK G3000 SW column (30 cm x 7.5 mm, 10 
^m particle size) which had previously been equilibrated with buffer A / 0.1 5 M NaCl. Proteins 
were eluted with the same buffer. Fractions of 200 |al were collected and tested for XT activity. 
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Column calibration was performed using thyroglobulin (669.000), ferritin (440.000), 
aldolase (158.000), albumin (67.000), ovalbumin (43.000). 

Example 17: N-glycosidase F digestion. Aliquots (1 ^g) of XT were digested with 3,1 x 
10 " 3 units of N-glycosidase F at 37°C for 1 h and 12 h (Table IT). The samples were then 
5 subjected to SDS-PAGE, and protein bands were detected by silver staining. 

Example 18: Measurement of protein concentration. Protein concentration was estimated 
by absorbance at 280 nm assuming E l% i cm = 10.0 or with the Bicinchoninic Acid Protein 
Assay using bovine serum albumin as a standard. 

Example 19: PCR-based cloning of human xylosyltransferase. Degenerate oligonucleotide 

10 primers with deoxyinosine substitution were designed based upon the amino acid sequence of 
peptides obtained after digestion of the isolated human XT with trypsin or Lys-C. The first 
strand of cDNA was synthesized by the reverse transcription reaction using poly(A) + RN A 
isolated from the chondrosarcoma cell line SW1353 as template and oligo(dT) as primer. The 
reverse transcription reaction was performed at 37°C for 2 h using 50 pmol of oligo(dT) 

15 primers, 1 jig of poly(A) + RNA, a 0,5 mM concentration of each dNTP, lx RT buffer and 200 
units of RNase H deficient Moloney murine leukemia virus reverse transcriptase (Life 
Technologies, Eggenstein, Germany) in a final volume of 20 |il. For PCR amplification the 
reaction mixture contained 4 \i\ of the reverse transcription reaction solution, 50 pmol of each 
primer, a 0.25 mM concentration of each dNTP, 50 mM KC1, 10 mM Tris-HCl (pH 8.3), L5 

20 mM MgCk and 2.5 units of hot start Taq polymerase (Life Technologies) in a final volume of 
50 jj! Amplification with degenerate oligonucleotide primers was carried out by 40 cycles at 
94°C for 1 min, 48°C for 1 min, and 72°C for 2 min, followed by a final extension step at 72°C 
for 15 min. After agarose electrophoresis of the PCR products, DNA fragments were excised, 
subcloned into the pCR2.1 vector (Invitrogen. Groningen, Netherlands) and sequenced by the 

25 dideoxy chain termination method using Taq DNA polymerase (Big-dye terminator cycle 

sequencing kit, Perkin-Elmer, Norwalk, CT, USA) with an automated DNA sequencer (Applied 
Biosystems, Weiterstadt, Germany). Several clones were sequenced to compensate for 
misreading by Taq polymerase. 

Example 20: Rapid amplification of 5 ' and 3 ' cDNA ends (RACE). For amplification of the 5' 
30 and 3 5 ends of the XT-I and XT-II isoform cDNA RACE experiments were performed using 
commercially available systems (Clontech, Heidelberg, Germany; Life Technologies) according 
to the manufacturers' instructions. For 3'-RACE 1 \ig placenta poly(A) + mRNA (Clontech) was 
reverse transcribed with a 3'-CDS primer (Clontech). PCR amplification of the 3' cDNA end of 
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human XT-I was accomplished according to a touch-down PCR protocol (5 cycles: 94°C for 30 
s, 72°C for 3 min, 5 cycles: 94°C for 30 s, 70°C for 45 s, 72°C for 3 min, 25 cycles: 94°C for 30 
s, 65°C for 45 s, 72°C for 3 min) with the gene-specific primer GSPlb 5'- 
GTGGGTATGCAGAAGTGGGGGAAGGGAC-3' and the UPM primer mix (Clontech). An 

5 aliquot of the first PCR was subjected to semi-nested PCR (30 cycles) using the primer 
Con21 1 1 5'-CCCTCCGCAATGCCTACA-3' and the UPM primer mix. The DNA fragments 
obtained were subcloned into the vector pCR2.1 (Invitrogen) and sequenced. PCR amplification 
of the 3 5 cDNA end of the XT-II isoform was carried out with the primers AB102 6 7 5'- 
ACTG AGGTC ACGC AATAC AA-3 ' and UPlfo using the touch-down PCR protocol. For 5'-RACE 

10 1 ng of placenta poly(A) + mRNA was subjected to a reverse transcription and 5' tailing reaction 
with the 5 5 -CDS primer and the SMART oligo (Clontech) according to the manufacturer's 
protocol. The 5 5 end of the XT-I cDNA was amplified using the primer GSP_776L 5'- 
GCCGCACTCAG-GTGATGAAGAAGT-3 ' and UPM with a touch-down PCR protocol. An 
aliquot was used as template in a second semi-nested PCR reaction with the primers GSP_503L 

15 5^ACCA-CCAGGACAAAGGCGATTCTGA-3' and UPM. The 5'-cDNA end of XT-II was 

obtained by 5'-RACE amplification using the primers AB10315L 5 ' -AGTCGAACAGTCCAGG- 
GCCC-3 ' and UPM mix. A new primer for a S'-RACE reaction was designed based upon the 
nucleotide sequence of the largest fragment obtained (2 kbp), and the experiment was repeated 
with the primer AB5846L 5 5 -C ACGATCTCGC ACTTGGGGG-3 5 as described above. The 

20 nucleotide sequences of human XT-I and XT-II cDNA have been submitted to the 
GenBank/EBI Data Bank with the accession numbers AJ277441 and AJ277442. 
Example 21: Isolation ofXT-IcDNA from human brain and chondrocyte cDNA libraries. 
Plasmid-DNA was isolated from a human whole brain cDNA library (Life Technologies) and a 
human chondrocyte cDNA library (Clontech) and used as template in a PCR-based approach 

25 for isolation of the 5' end of the XT-I cDNA. Briefly, GSPJ776L and 5'ADLD 5>- 

CTATTCGATGATGAAGAT ACCCC ACC AAACCC-3 ' primers were used in a PCR reaction with 
1 |xg of plasmid DNA as template. The DNA fragments were subjected to agarose gel 
electrophoresis and 0.7-3 kb long fragments were excised from the gel and used as a template in 
a semi-nested PCR reaction with GSP_503L and S'-ADLD primers. DNA fragments were then 

30 subcloned into the vector pCR2. 1 and sequenced. 

Example 22: Cloning of rat XT-I and XT-II cDNA. A RT-PCR based approach using primers 
based upon the nucleotide sequence of human XT-I and XT-II was employed for amplification 
of XT-I and XT-II cDNA from rat. The first strand of cDNA was synthesized by the reverse 
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transcription reaction using poly(A) + RNA isolated from the rat liver cell line BRL3 A as 
template and oligo(dT) as primer. DNA fragments obtained after PCR amplification using 
moderate stringent conditions were subcloned into the vector pCR2.1 and sequenced. The 5' 
and 3 ' ends of the XT-I and XT-II cDNA from rat tissue were amplified using the RACE 
5 strategy with gene-specific primers as described above. The nucleotide sequences of rat XT-I 
and XT-II cDNA have been submitted to the GenBank/EBI Data Bank with the accession 
numbers AJ295748 and AJ295749. 

Example 23: Expression levels of the XT-I and the XT-II isoform in human tissues. 

A Human Multiple Tissue cDNA Panel (Cfontech) was uised for the analysis of expression 

10 levels. Levels of amplification of the housekeeping gene glyceraldehyde-3-phosphate 

dehydrogenase, whose transcript is always present in the tissues at an almost constant level, 
were determined in parallel for quality control. For amplification of the XT-I encoding 
transcript the primers 128U and 601L were used, whereas primers AB10267U and AB12394L 
5 a -GG AAG AGCTGGGTGTGGAAT-3 ' were employed for the XT-II. PCR reactions were carried 

15 out by 22-36 cycles at 94°C for 30 s, 60-65°C for 45 s and 72°C for 2 min. Amplification of a 
transcript was performed using a serial number of cycles to find the conditions for 
semiquantitative amplification, and aliquots were analyzed by agarose gel electrophoresis. 
Example 24: Transfection and transient expression of XT. For construction of a eukaryotic 
expression vector a DNA fragment including the known coding sequence of XT-I cDNA was 

20 amplified by PCR using XT_ExplL 5 ' -TTTCCCGTTGAGATCCTG CT-3 5 and XT_Exp3U 5'- 
AC AGACAGC AAC A ACGAGAA-3 ' as primers and placenta first-strand cDNA (Clontech) as 
template. The 2450 bp fragment obtained was cloned into the vector pcDNA4/HisMax-TOPO 
(Invitrogen) resulting in the fusion of XT-I to the Xpress epitope. The plasmid pCG227-XT-I 
was then transiently transfected into CHO-K1 cells. The coding region of the XT-II cDNA was 

25 amplified by PCR using the primers AB_Exp9U 5 '-AAAGGAAGGCAGAGGAAGC-3 5 and 
ABJ3xp3L 5'-ACCCCTCCACTGT-CTGTAAG-3' and placenta first-strand cDNA as template. 
The obtained 2440 bp DNA fragment was cloned into the expression vector pcDNA4/HisMax- 
TOPO resulting in the fusion of XT-II to the aminoterminal Xpress epitope. The plasmid 
termed pCG226-XT-II was transiently transfected into CHO-K1 cells. 2xl0 5 cells precultured 

30 for 1 day in a 35 mm diameter cell culture dish were transfected with 2 \ig of plasmid DNA and 
6 \xl of Fugene 6 transfection reagent (Roche, Mannheim, Germany). For determination of 
transfection efficiency CHO-K1 cells were transfected with 2 \ig of the control plasmid 
pcDNA4/HisMax-TOPO-/acZ. 48 h after transfection the cell culture medium was harvested. 
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Protein G agarose beads (Sigma, Deisenhofen, Germany) and mouse anti-Xpress monoclonal 
antibody (Invitrogen) were added to the cell culture supernatant and incubated at 4°C for 1 h. 
After centrifugation at 10,000 g for 1 min the absorbed proteins were twice washed with PBS 
and resuspended in a final volume of 50 XT activity was then assayed in the samples. 

5 Example 25: Characterization of the reaction products. For characterization of the reaction 
product of recombinant XT-I the peptide QEEEGSGGGQK, which is homologous to the amino 
terminus of bikunin (Brinkmann et al 9 1997, La), was used as acceptor in the XT activity 
assay. After incubation for 75 min at 37°C the enzyme was heat-inactivated by incubation for 
15 min at 65°C and the reaction mixture was used for a- and P-xylosidase treatment and 

10 alkaline p-elimination. For the linkage-specific digestion of the bound [ l4 C]-D-xylose 4 mU of 
a-xylosidase (Seikagaku Corporation, Tokyo, Japan) or 4 mU of P-xylosidase (Sigma, 
Dreieich, Germany) were added to the samples and incubated for 60 min at 37°C. The reaction 
mixtures were then placed on Immobilon-AV membrane discs and allowed to dry. The discs 
were washed with trichloroacetic acid as described above and the remaining incorporated 

15 radioactivity was determined against appropriate controls. The alkaline cleavage of the O- 
glycosidic linkage between the [ I4 C]-D-xylose and the P-hydroxyamino acid serine was 
performed as described elsewhere (Montreuil et aL, 1994). Briefly, the reaction mixture was 
adjusted to pH 10 with diluted NaOH and an equal volume of cold, freshly prepared 
sodiumboro-hydride solution (2 M sodiumborohydride in 0.1 M NaOH) and was added After 

20 incubation at 45°C for 1 6 h the cooled solution was neutralized by adding 50% acetic acid and 
placed on Immobilon-AV membrane discs. After drying the discs were washed with 
trichloroacetic acid and the remaining radioactivity was measured by liquid scintillation 
counting. 
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Patent Claims 

1 . An isoform of UDP-D-xyIose:proteoglycan core protein JS-D-xylosyltransferaseQCT). 

5 2. A protein comprising an amino acid sequence of the isoform of claim 1 or a portion thereof, 
having the biological activity of XT. 

3. A protein of claim 1 or 2 deriving from human or rat sources (hXT, rXT) 

10 4, An isolated protein according to claim 3, wherein said hXT has a molecular weight of 
120.000 under SDS polyacrlylamide gel electrophoresis conditions. 

5. A recombinant protein of claim 3, wherein said protein is hXT-I comprising at least 827 
amino acids. 

15 

6. A protein according claim 5, having the amino acid sequence-as depicted in Fig. 7B. 

7. A protein according to claim 3, wherein said protein is hXT-II comprising 865 amino acids 
and is an isoform of hXT-I. 

20 

8. A protein according to claim 7 exhibiting approximately 55% overall sequence identity to 
hXT-I 

9. A protein according claim 8, having the amino acid sequence as depicted in Fig. 8B. 

25 

10. A DNA sequence coding for a protein of any of the claims 1-9. 

1 L A DNA sequence according to claim 10 comprising the nucleotide sequence coding for 
hXT-I as depicted in Fig. 7 A, or rXT-I as depicted in Fig. 9. 

30 

12. A DNA sequence according to claim 10 comprising the nucleotide sequence coding for 
hXT-II as depicted in Fig. 8 A or rXT-II as depicted in Fig. 10. 
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13. An expression vector comprising a promotor sequence, a DNA sequence of claim 10 and 
optionally a signal sequence. 

14. An expression host cell comprising a vector of claim 13, said host cell being capable of 
expressing a protein of any of the claims 1-9. 

15. An antibody directed against a protein as defined in any of the claims 1-9. 

16. A process for isolating and purifying a protein as defined in claims 1-4, characterized by 
the following steps: 

(i) culturing cells having an increased level of XT, and harvesting the 
supernatant of said cell culture, 

(ii) fractionated ammonium sulfate precipitation of the supernatant of step (i), 

(iii) heparin affinity chromatography of the precipitate of step (ii), 

(iv) ion exchange chromatography of the step (iii) product, 

(v) affinity chromatography of the step (iv) product, and 

(vi) SDS-Polyacrylamide gel elektrophoresis of step (v). 

17. A pharmaceutical composition comprising a protein of any of the claims 1-9 and a 
pharmacologically acceptable carrier, diluent or excipient. 

1 8. Use of a protein of any of the claims 1 - 9 for the manufacture of a medicament for the 
treatment of sclerotic diseases and chronic inflammatory joint diseases. 

19. Use of claim 18, wherein said medicament is an inhibitor or antagonist of said protein. 

20. Use of a protein of any of the claims 1 - 9 as diagnostic marker. 

21 . Use of a DNA molecule of any of the claims 10 - 12 as gene marker. 

22. Use of an antibody as defined in claim 15 in an immunological assay for determination of a 
protein having the biological activity of hXT as diagnostic tool. 
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® 



SPPS1 5'-ATH GCI AGY CCI CCI AGY GA-3' 

Peptide 4 lie Ala Ser Pro Pro Ser Asp Ph© Gly Arg 

Inv2b 5 1 -TAD CGN AG I GGN GGI AGI CT-3' 

Inv2c 5' -TAD CGN TCR GGI GGI TCR CT-3' 



DF1 5 1 -GAY TTY GAR AAY GTI GAY AA-3' 

Peptide 6 Asp Phe Glu Asn Val Asp Asn Ser Asn phe Ala Pro Arg 

PFF-sense 5' -CCI ACI TTY TTY GCN CG-3 f 
Peptide 7 Pro Thr Phe Phe Ala Arg 

Peptide 9 Glu Leu Gly Ala Val Lys Pro Asp Gly Arg Leu 

Lysc-invl 3 f -CCN CGI CAN TTY GGN CTR CC-5 1 
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Fia. 7A 

ACTCAGGATGGCTACTTTTCTCATCGGCCGAAAGAGAAAGTGCGAACAGACAGCAACAACGAGAACTCT 

GTCCCCAAAGACT TTGAGAATGT GGAC AACAGCAACTT C GCACCCAGGACTCAAAAGCAGAAGCACCAG 

CCTGAG T T GGCGAAGAAGCCAC CGAGTAGACAGAAGGAGCT T T TGAAAAGGAAGC TGGAACAGCAGGAG 

AAAGGAAAAGGACATACATTCCCTGGGAAAGGCCCCGGTGAGGTGCTGCCTCCCGGGGACAGAGCCGCA 

GCCAACAGCAGCCACGGGAAGGATGTGTCCAGACCGCCTCATGCCAGGAAAACTGGGGGCAGCTCCCCC 

GAGACCAAGTATGACCAGCCCCCTAAGTGTGACATCTCAGGCAAGGAGGCCATCTCTGCCCTGTCCCGT 

GCTAAGTCCAAGCACTGCCGCCAGGAGATTGGGGAGACTTACTGCCGCCACAAGTTAGGGCTGCTGATG 

CCTGAGAAGGTGACTCGGTTCTGCCCCCTCGAGGGTAAAGCCAACAAGAACGTGCAGTGGGACGAGGAC 

TCCGTGGAGTACATGCCAGCCAACCCGGTCAGAATCGCCTTTGTCCTGGTGGTCCACGGCCGTGCCTCT 

CGGCAGTTGCAGCGCATGTTCAAGGCCATCTACCACAAAGACCACTTCTACTACATCCACGTGGACAAG 

CGCTCTAATTACCTGCATCGGCAAGTGCTCCAGGTCTCCAGGCAGTACAGCAATGTCCGCGTCACCCCC 

TGGAGAATGGCCACCATCTGGGGAGGAGCCAGCCTCCTGTCCACCTACCTGCAGAGCATGCGGGACCTC 

CTGGAGATGACCGACTGGCCCTGGGACTTCTTCATCAACCTGAGTGCGGCCGACTACCCCATCAGGACA 

AATGACCAGTTGGTGGCGTTTCTCTCCCGATACCGAGATATGAATTTCTTGAAGTCACACGGCCGGGAC 

AATGCAAGGTTCATTCGGAAGCAGGGCCTGGATCGGCTCTTCCTGGAGTGCGACGCTCACATGTGGCGC 

CTGGGAGATCGGCGGATCCCAGAGGGCATTGCCGTGGATGGCGGTTCGGACTGGTTCCTGCTGAACCGG 

AGGTTTGTAGAATATGTGACCTTCTCCACAGACGATCTGGTGACCAAGATGAAACAGTTCTACTCCTAC 

ACCCTGCTTCCTGCTGAGTCCTTCTTCCATACGGTCCTGGAGAACAGCCCCCACTGCGACACCATGGTG 

GACAACAACCTGCGCATCACCAACTGGAATCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCACATCGTG 

GACTGGTGCGGCTGCTCCCCCAATGACTTCAAGCCGCAGGACTTCCACCGCTTCCAGCAGACAGCCCGG 

CCTACCTTCTTTGCCCGCAAGTTTGAAGCCGTGGTGAATCAGGAAATCATTGGGCAGCTGGACTATTAC 

CTGTACGGGAACTACCCTGCAGGTACCCCGGGCCTGCGCTCCTACTGGGAGAATGTCTACGATGAGCCT 

GACGGCATCCACAGCCTGAGCGACGTGACACTCACCTTGTACCACTCCTTTGCCCGCCTGGGTCTTCGA 

CGGGCTGAGACGTCCCTGCACACGGATGGGGAGAACAGCTGCCGATACTACCCAATGGGCCACCCAGCA 

TCTGTGCACCTCTACTTCCTTGCTGACCGCTTCCAGGGCTTTCTGATCAAGCATCATGCTACCAATCTG 

GCTGTGAGCAAACTAGAGACTCTGGAGACCTGGGTGATGCCGAAAAAAGTCTTCAAGATCGCAAGCCCA 

CCCAGTGACTTTGGGAGGCTTCAGTTTTCCGAGGTCGGCACTGACTGGGATGCCAAGGAGAGGCTATTC 

CGCAACTTTGGGGGTCTTCTGGGGCCCATGGATGAGCCGGTGGGTATGCAGAAGTGGGGGAAGGGACCT 

AATGTGACCGTGACCGTCATTTGGGTGGATCCCGTCAATGTCATCGCAGCCACCTACGACATCCTCATT 

GAGTCCACTGCCGAATTCACACACTACAAGCCCCCTTTGAACTTGCCCCTGAGGCCTGGGGTCTGGACA 

GTGAAAATTCTCCACCACTGGGTGCCAGTTGCAGAGACCAAATTCCTCGTTGCGCCTCTGACCTTCTCG 

AACAGGCAGCCCATCAAACCTGAGGAGGCACTGAAGCTGCACAATGGGCCCCTCCGCAATGCCTACATG 

GAGCAGAGCTTCCAGAGCCT2\AACCCCGTCCTCAGCCTGCCCATCAACCCCGCCCAGGTGGAACAGGCA 

CGGAGGAACGCAGCCTCCACGGGCACAGCGCTGGAGGGATGGCTGGACTCGTTGGTGGGCGGGATGTGG 

ACTGCCATGGACATCTGTGCCACGGGCCCCACAGCCTGCCCGGTCATGCAGACCTGCAGCCAGACGGCC 

TGGAGCTCCTTCAGCCCTGACCCCAAGTCGGAGCTGGGTGCCGTCAAACCTGATGGCCGGCTCAGGTAG 

CACTGGGCACGAGGAGTGGGCCACAGCAGGATCTCAACGGGAAAGCAGCCAGAGGGGTTGTGGGGCCTG 

AACCCCGGCCTCCACCCTGGGGGAGGCCCTCTGTGAATGGGTCTCTCCTGGCCATAGAATGATGGAAAG 

GGAAGGTCAGCAGGTCAAAGCAGGATCAGCCAACAACCTGCCTTTGGCAAGCTGCGGGTGGGATGGCTC 

AGTCCCTGCACTGTGACTGTCTCACCTCTTCTGSTTGATCCTCAAGTCCTACAGGTTCCTTGTCTTCCC 

CTTCCAGTGACCCACCCCTGACCCCAGACGTGTGATTTTCAGACTTTTCTTTCGAGCAGCAGAACTTCG 

TTTACGGAGCACAGTCATAAGTGRAGGTTCAGGGTGCTGACGAAATCCAAGCTGCTCTGGTTGAAGCTG 

ACAAGTGCGAGGTTCCCTCCCAAAGCTCAGCCCTCTGGGCGGTCCCCTTGCCCAGGGTATCTCCTACGG 

TACCTCTTCAGAACCCAAGGGCTCTGCAAATGCCAGTTTGACAAGCACTGCCCAGACCAACCATGGGTT 

CAAACTCCAGCCCTGCCCTTTGGTTCATTTTTCTGCTTCTCTTGGCTGGGGGACTCTGGTGCCAGCCTT 

GAAAGTCATGGTCGTGGGCCCTTTCCCATGGAGGCTGCAGCCTTAGGAGAGCTCTGAGCCTCTCAGCAG 

CCCTCCTTGGGTTGAACTATTCTCCTTAGTAACTAGGTAAGTGGGAAAGCCTTTTGATGTGGCATGGCC 

AAGGTCCAGCCACAAGTGCAACTGCCACCTGTCCAGGGGTCTGGGCCTCCTTCCCTCAAGGCTGCCACA 

CAAAGTAGCAGAAAT AGGAT GATGT TTGT GAGC ACCAGACTCAAGACC AT GACCT TCT T T GAT CC T T GA 

AAAT GGGAACTT TGACAGCCATGACCAT GAAAC T CACAAGGCAACGCGGAT GAAACTC ACAAAGCAATG 

CTTGGAGCAAAACTCCTGAGCTAGACAGCACAGCAGCACCCATCCCCTGCCAGAGCCCTTCCGTTTGAG 

GT CAGACAC ACAAAACCT T CGTCAAT TGCACACC GGTGCTGTTGGGAGT GACCAAACCACATGAACCAG 

ACTTTTCCCGTCCAGGAAATAGCATTTCAGATTTGGTTTTTAATTTCATGCCCTTCGGCCACAGGCTCA 

ACGGGACATGCAACATAAAAATGGGAAGGTTATTCAACCCGCGGGAAAAAAAAAA2\AAAAAAAAAAAAA 
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Fig.7B 
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Fig.8A 

CGCAGGCCCCGCCCCCGGCCCCGCCCCGGCGCGCGCCCGGCCCGCCTCCCTAGGCGTGGAGGAGGGGGGG 

CGGCTCAGCCCCGCGCCCGTGCGCGCGCTCGCGGCCGGGTTGCAGGGCTGGGCGCGCGCCCCGCGTCCCG 

GGCAGGAAGATGGTGGCGAGCGCGCGAGTGCAGAAGCTGGTGCGGCGCTACAAGCTGGCGATTGCCACGG 

CGCTGGCCATCCTGCTGCTGCAGGGCCTGGTAGTGTGGAGCTTCAGCGGCCTGGAGGAGGACGAGGCGGG 

C GAGAAAGGAAGGCAGAG GAAGC CAC GGCC AC TGGACCC TGGCGAAGGTTCCAAGGACACAGACAGTTC A 

GCAGGGCGACGGGGCAGCACAGGCAGAAGGCATGGGCGCTGGCGGGGCCGTGCTGAGAGCCCAGGAGTGC 

CCGTGGCCAAGGTGGTACGGGCAGTAACCAGCCGGCAGAGAGCCAGCCGGCGGGTCCCACCTGCCCCACC 

TCCGGAAGCCCCAGGCCGCCAGAACCTGAGTGGGGCAGCAGCTGGGGAGGCGCTGGTAGGGGCAGCTGGC 

TTCCCACCACACGGAGATACAGGGAGCGTGGAGGGCGCCCCCCAGCCCACGGACAATGGCTTCACCCCCA 

AGTGCGAGATCGTGGGCAAGGACGCACTGTCTGCACTGGCCCGGGCCAGCACCAAGCAGTGCCAGCAGGA 

GATCGCCAATGTGGTGTGCCTGCACCAGGCTGGGAGCCTCATGCCCAAGGCTGTGCCCCGGCACTGTCAG 

CTGACTGGGAAGATGAGCCCCGGCATCCAGTGGGATGAGAGCCAAGCCCAGCAGCCCATGGATGGCCCCC 

CGGTGCGAATCGCCTACATGCTGGTGGTTCACGGCCGCGCCATCCGCCAGCTGAAGCGTCTCCTCAAGGC 

CGTTTATCACGAGCAGCACTTCTTTTACATCCATGTGGACAAGCGTTCCGACTACCTGCACCGGGAGGTG 

GTGGAGCTGGCCCAGGGCTATGATAACGTGCGGGTGACGCCCTGGCGCATGGTTACCATCTGGGGCGGGG 

CCAGCCTCCTGACGATGTACCTGCGGAGCATGCGGGACCTGCTAGAGGTGCCTGGCTGGGCCTGGGACTT 

CT TCAT CAACCT CAGTGCCACTGACTAT CCAACCAGGACCAAT GAGGAGCT GGT GGCAT TCCTAT CCAAG 

AACCGGGACAAGAATTTCCTCAAGTCACATGGCCGGGACAACTCCAGGTTCATCAAGAAACAGGGCCTGG 

ACCGGCTCTTCCATGAGTGCGACTCACACATGTGGCGCCTGGGCGAGCGGCAGATCCCAGCAGGCATTGT 

GGTGGATGGCGGTTCTGACTGGTTCGTGCTGACACGCAGCTTTGTGGAGTATGTGGTGTACACAGATGAC 

CCGCTTGTGGCCCAGCTGCGCCAGTTCTACACATACACACTGCTCCCAGCCGAGTCCTTCTTCCACACGG 

TGCTGGAGAACAGCCTGGCCTGTGAGACCCTCGTGGACAACAACCTGCGGGTCACCAACTGGAACCGCAA 

GCTGGGCTGCAAGTGCCAGTACAAGCACATTGTGGACTGGTGTGGCTGCTCCCCCAACGACTTCAAGCCA 

CAGGACTTCCTCCGGCTGCAGCAAGTCTCCAGACCCACCTTCTTCGCCCGGAAGTTCGAGTCGACTGTGA 

ACCAGGAGGTGCTGGAAATCCTGGACTTCCACCTGTACGGCAGCTACCCCCCCGGCACGCCAGCCCTCAA 

GGCCTACTGGGAGAACACCTACGACGCGGCTGATGGCCCCAGTGGGCTCAGTGATGTCATGCTCACTGCT 

TACACAGCCTTCGCCCGCCTCAGCCTGCACCATGCCGCCACTGCTGCACCCCCAATGGGCACCCCACTCT 

GCAGGTTTGAGCCCAGGGGCTTGCCGTCCAGCGTGCACCTGTATTTCTATGACGACCATTTCCAGGGCTA 

CCTGGTGACGCAGGCGGTGCAGCCCTCAGCCCAGGGGCCGGCAGAGACGCTTGAGATGTGGCTGATGCCC 

CAAGGGTCGCTGAAGCTGTTGGGGCGCAGTGACCAGGCCAGCCGGCTCCAGAGTCTGGAGGTTGGCACTG 

ATTGGGACCCCAAAGAGCGTCTTTTCCGGAACTTTGGGGGGTTACTGGGGCCGCTGGACGAGCCTGTGGC 

CGTGCAGCGCTGGGCCCGGGGCCCCAACCTCACAGCCACAGTGGTCTGGATCGACCCAACCTATGTGGTG 

GCCACATCTT ATGAC AT CAC AGTAGATACGGAGAC T GAGGTCACGCAAT ACAAGCCCCCACT GAG CC GGC 

CCCTGCGGCCAGGGCCCTGGACTGTTCGACTCCTTCAGTTCTGGGAACCGCTGGGTGAGACCCGCTTCCT 

TGTGCTGCCCTTGACCTTCAACCGCAAACTACCTCTCAGGAAAGATGATGCCAGCTGGCTGCACGCAGGG 

CCACCCCACAACGAGTACATGGAGCAGAGTTTCCAGGGCCTGAGTAGCATCCTGAACCTGCCTCAGCCGG 

AGCTCGCGGAGGAGGCTGCCCAGCGGCACACACAGCTCACAGGCCCTGCGCTCGAGGCCTGGACAGACAG 

GGAACTGAGCAGCTTCTGGTCCGTGGCTGGACTGTGTGCCATAGGCCCCTCTCCCTGCCCCTCCCTGGAG 

CCCTGCAGACTGACCAGCTGGAGCTCTCTGTCCCCCGACCCCAAATCAGAGCTGGGGCCTGTCAAAGCAG 

ACGGGCGACTCAGGTAGCAGGGCCCCAGCCAGTACCCGTGGAGGACCCGGGAAATTGCACCTTACAGACA 

GTGGAGGGGTGTCCCCTCCCACAGGCAAGAACCAGAGGCCCAGGCTGCACACCCATTTCAGCCATCAAGA 

ACCCAC ACAGACGGC AGGGAAGGTGGACACAGTATGAACTACT GC TGAT GT C T C T GT TGGG GATC AGAGG 

GCTGGCGGGAACGCGAGAAGGGCACCAGCAGCATTCCACACCCAGCTCTTCCTCACCTTCCTGTCTAGTT 

TGAATTTCTTTTTTTTCTTTCTTTTTTTTTTTTTTTTAATTTAAAAAGGAAAATGGGTGGTTGGGAGAGA 

AACTAGAACAGAAGAT GTGCAATAGGGC TCAGAGCAGCCCCAGGAAGTGGGC C ATGT CT GTGGGGAGCAG 

TGGCTTGCTGAGCCCACCTGCTATTGTTCTGCACGAGGCTGGGCCTGCCTCACTCTCCAGGGCCTCATGC 

CCCATTCTGGGCCTGTGGTGCTCGTGGCTGAGGCTCCACAGGGCTGCAAGTGCCCTGCCAGGCTCTAAGG 

CCCGAAGAACAGGTGATATCGGGGGCGCTGCAGAGCTGGGCTCTAGCAGGACAGTTCTTTTGTAGCCAGG 

GGACCCTAGAAGTGGGGTGGGGCGGCTCCAGGGCTTTCCAGCATACCCTGCCCCTGGATGGGAAAGGCAG 

GGTCAGGGCCCACTGAGACCCATGCAGAGTTTCCAGGTGTGCACTGGAAGTGAGGTCACATGAGCAGCGT 

GGGAAGAAGACTCTGTCAAGACTCTCAGAAGAACCTGGAGTAATTGTGCCTGAAGCTCAGCGTGAAGTCT 

GAAATATGCAACAGAAGAAATATATCTCTATCTCTCTA 
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fig-8B 
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Gly Pro &§n Leu Thr Ala Thr Val Val Trp lie Asp Pro Thr Tyr Val Val 
Ala Thr Ser Tyr Asp He Thr Val Asp Thr Glu Thr Glu Val Thr Gin Tyr 
Lys Pro Pro Leu Ser Arg Pro Leu Arg Pro Gly Pro Trp Thr Val Arg Leu 
Leu Gin Phe Trp Glu Pro Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu 
Thr Phe Asn Arg Lys Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His 
Ala Gly Pro Pro His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser 
Ser He Leu Asn Leu Pro Gin Pro Glu Leu Ala Glu Glu Ala Ala Gin Arg 
His Thr Gin Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Arg Glu Leu 
Ser Ser Phe Trp Ser Val Ala Gly Leu Cys Ala lie Gly Pro Ser Pro Cys 
Pro Ser Leu Glu Pro Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp 
Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
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Fig. 9 



TACTTCTCTCACCGGCCCAAAGAGAAAGTTCGGACCGACAGCAACAATGAGAACTCAGTCCCCAA 
GGACTTTGAGAATGTGGACAACAGCAACTTCGCACCCAGGACTCAAAAGCAGAAGCACCAGCCAG 
AGTTGGCAAAGAAGCCCCTCAGCAGGCAGAAAGAGCGTTTGCAGAGAAAGCTGGGCGCCCAGGAC 
AAAGGACAGGGGCAGTCAGTCCTAGGAAAAGGCCCCAAGGAGGTGCTGCCTCCTCGGGAAAAAGC 
CCCAGGCAACAGTAGCCAAGGGAAGGATCTCTCAAGACACAGCCATAGCAGGAAGAGTGGTGGCG 
GTGGGTCCCCGGAAACTAAGTCTGACCAGGTCCCCAAGTGTGACATCTCTGGCAAGGAGGCCATC 
TCAGCGCTGACCCGCGCTAAGTCCAAGCACTGTCGCCAGGAGATTGCAGAAACCTACTGTCGCCA 
CAAGCTGGGGCTGCTGATGCCAGAGAAGGTGGCTCGATTCTGTCCCCTGGAAGGCAAAGCCAACA 
AGAATGTCCAGTGGGATGAGGATGCTGTTGAGTACATGCCCCCCAACCCAGTCAGAATCGCCTTT 
GTCCTGGTGGTCCATGGCCGTGCCTCTCGACAGCTGCAGCGCATGTTTAAGGCCATCTACCACAA 
AGACCATTTCTACTATATCCATGTGGATAAGCGTTCCAATTACCTGCATCGGCAAGTGCTCCAGT 
TCTCCAGGCAGTACGACAACGTCCGAGTCACTTCCTGGAGGATGGCCACCATTTGGGGCGGAGCC 
AGCCTCCTGTCCACTTACCTGCAGAGCATGCGGGATCTACTGGAGATGACTGACTGGCCCTGGGA 
CTTCTTCATCAACCTGAGTGCGGCTGACTACCCCATCAGGACAAATGACCAGCTCGTAGCATTTC 
TTTCCAGATACCGAGATATGAATTTCCTGAAGTCACATGGTCGGGACAATGCAAGGTTCATCCGG 
AAGCAGGACCTGGACCGTCTCTTCCTGGAGTGTGATACACACATGTGGCGCCTGGGGGACCGGCG 
AATCCCAGAGGGCATTGCTGTAGATGGCGGCTCTGATTGGTTCCTGCTAAACCGGAAGTTTGTAG 
AGTATGTGGCATTCTCCACAGATGACCTGGTGACCAAGATGAAGCAGTTCTACTCTTATACCCTC 
CTCCCTGCTGAGTCCTTTTTCCACACGGTCCTGGAGAACAGCCCCCACTGTGACACCATGGTGGA 
TAACAACCTGCGCATCACCAATTGGAACCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCATATTG 
TGGACTGGTGTGGCTGCTCTCCCAATGACTTCAAGCCTCAGGACTTCCATCGCTTCCAGCAGACG 
GCCCGGCCCACCTTCTTTGCCCGAAAGTTCGAAGCCATAGTGAACCAGGA7VATCATTGGGCAGTT 
GGACTCTTACCTGTATGGCAACTATCCTGCGGGCACCCCGGGCCTCCGCTCTTACTGGGAGAATG 
TCTATGACGAACCAGATGGCATCCAGAGCCTCAGCGATGTGGCACTCACCATGTATCATTCCTTC 
ATCCGCCTGGGTCTTCGAAGGGCTGAGTCATCGCTACACACGGATGGGGAGAACAGCTGCAGGTA 
CTATCCAATGGGCCACCCAGCGTCTGTCCACCTCTACTTCCTGGCTGACCGATTCCAGGGCTTTC 
TGATCAAGCACCATGTGACCAACCTTGCTGTGAGCAAACTGGAGACACTGGAGAGATGGATGATG 
CCAAAGAAAGTCTTCAAGGTCGCAAGTCCCCCCAGTGACTTTGGGAGGCTTCAGTTTTCTGAGGT 
TGGCACTGACTGGGATGCCAAGGAGAGACTATTCCGGAACTTTGGTGGTCTTTTGGGGCCCATGG 
ATGAGCCGGTGGGAATGCAGAAATGGGGGAAGGGACCCAATGTGACCGTGACTGTTATTTGGGTG 
GATCCTGTCAATGTCATCGCAGCCACCTATGATATCCTGATCGAGTCCACTGCGGAATTCACACA 
CTACAAACCCCCTTTGAATCTGCCTCTGAGGCCTGGTGTCTGGACAGTGAAGATTCTCCATCACT 
GGGTACCAGTTGCAGAGACCAAATTCCTTGTCGCACCTCTGACCTTCTCCAACAAGCAGCCCATC 
7^AACCAGAGGAGGCTTTGAAGCTGCACAATGGGCCACCTCGCAGTGCCTACATGGAGCAGAGTTT 
CCAGAGCCTAAACCCAGTCCTCAGCCTGCACATCAATCCTGCCCAAGTGGAGCAGGCCCGGAAGA 
ACGCAGCCTTCACTGGGACAGCACTAGAAGCCTGGCTGGTGGGTGGGACTTGGACTGCCATGGAC 
GTCTGTGCCACAGGCCCCACTGCCTGCCCAGTCATGCAAACCTGCAGCCAAACAGCCTGGAGTTC 
CTTCAGCCCTGATCCCAAGTCAGAGCTGGGTGCAGTGAAGCCTGATGGTCGGCTCAGGTAGCACT 
GGCCACATGGGACTGATACAGCCTGAACCCTGGCCTCTGATCTTATATGGGGGAGGGGTTCCTTC 
TGTTCAAGGACCTCTGTTGCCTGTAGGACAGTGGAAAGAGAGGATTTGGAGGCCAGA 
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GCGCTCGCGGCCGGGTTGCAGGGCCGGACGCGCGCCCCGCGTCCCGGGCAGGAAGATGGTGGCGA 
GCGCGCGGGTGCAGAAGCTGGTGCGGCGCTACAAGCTGGCGATCGCCACCGCGCTGGCCATCCTG 
CTGCTGCAGGGCCTGGTGGTGTGGAGCTTCAGCGGCCTGGAGGAGGACGAGCCGGGCGAGAAAGG 
AAGGCAGAGAAAGCCACGGCCGCTAGACCCTGGAGAGGGTTCCAAGGACACGGACAGCTCTGCAG 
GGCGTAGGGGCAGTGCTGGGAGAAGGCACGGACGGTGGCGGGGCCGCGCCGAGAGCCCAGGTGTG 
CCTGTGGCCAAGGTGGTGCGGGCAGTAACCAGTCGGCAGAGAGCCAGCCGACGGGTGCCCCCTGC 
CCCACCTCCAGAGGCACCAGGCCGCCAGAACCTGAGTGGAGCAGCAGCCGGGGAAGCACTGATAG 
GTGCCGCCGGCTTCCCACAACATGGAGACACAGGGAGTGTGGAGGGTGCCCCTCAGCCCACGGAT 
AACAGCTTCACTCCGAAGTGTGAGATTGTGGGCAAGGATGCACTGTCAGCACTGGCCCGGGCCAG 
CACCAAGCATTGTCAACAAGAGATTGCTAATGTAGTGTGCCTGCACCAAGCTGGGAACCTAATGC 
CCAAGTCTGTGCCCCGGCACTGCCAGCTGGCTGGCAAGGTGAGCCCTGGCATCCAGTGGGAAGAG 
GTCCGGGCCCAGCAGCCTGTGAGTGGCCCTCTGGTACGCATCGCCTACATGCTGGTGGTTCACGG 
CCGTGCTGTGCGCCAGCTGAAGCGTCTTCTGAAGGCCGTCTACCACGAGGAGCACTTCTTTTATA 
TTCATGTGGACAAGCGTTCCAACTACCTGTACCGGGAGGTGGTAGAGCTGGCCCAGCACTACGAC 
AATGTACGGGTAACACCTTGGCGCATGGTCACCATCTGGGGTGGGGCTAGCCTTCTGAGGATGTA 
CCTGCGGAGCATGAAGGACCTTCTGGAGACTCCTGGCTGGACCTGGGACTTCTTCATCAACCTGA 
GCGCTACTGACTATCCAACCAGGACGAATGAGGAGCTGGTAGCGTTCTTATCCAAGAACCGGGAC 
AAGAATTTCCTCAAGTCACACGGGCGAGACAATTCCAGGTTCATCAAGAAACAAGGCCTGGACCG 
GCTTTTCCATGAGTGTGATTCCCACATGTGGCGCCTGGGTGAACGGCAGATCCCGGCAGGCATCG 
TGGTGGATGGTGGCTCTGACTGGTTCGTGCTGACACGCAGCTTTGTGGAATATGTGGTGTATACA 
GAGGATCCTCTGGTGGCCCAGCTTCGCCAGTTCTATACATACACATTGCTTCCAGCCGAGTCCTT 
CTTCCACACAGTGCTGGAGAACAGCCCAGCCTGTGAGAGCCTAGTGGACAACAACCTGCGGGTTA 
CCAACTGGAACCGGAAGCTGGGCTGCAAGTGCCAGTACAAGCACATCGTGGACTGGTGTGGCTGC 
TCCCCCAACGACTTCAAGCCACAGGACTTCCTGCGGCTTCAGCAAGTCTCCAGACCCACCTTCTT 
TGCCCGGAAGTTTGAGTCGACTGTGAACCAGGAAGTCCTGGAAATTTTGGACTTCCACCTGTATG 
GCAGCTACCCACCCGGCACCCCAGCCCTCAAGGCCTACTGGGAGAACATCTACGACATGGCCGAT 
GGCCCTAGTGGACTCAGCGATGTCCTACTCACTGCTTACACAGCCTTTGCCCGTATCAGTCTGCG 
TCATGCTGCCACTGTTTCCCCACTGGCCACTGCAGTCTGCAGGTTTGAGCCCAGGGGGTTGCCGT 
CCAGCGTGCACCTGTATTTCTATGACGACCATTTCCAGGGCTACCTGGTGACGCAGGCAGTGCAG 
CCCTCAGCCCAGGGGCCAGCAGAGACACTTGAGATGTGGCTGATGCCCCAGAGGTTGCTGAAGCC 
GTTGGGGCACAGTGACCAGGCCAGCCGGCTCCAGAGTCTGGAGGTTGGCACTGAGTGGGACCCCA 
AAGAACGTCTCTTCCGGAACTTTGGGGGCCTGTTGGGACCACTGGATGAACCTGTGGCCATGCAG 
CGCTGGGCCCGGGGCCCCAACCTCACAGCCACTGTGGTCTGGATTGACCCCACCTATGTTGTGGC 
CACATCCTATGACATCACGGTAGATGCGGACACTGAAGTCACGCAGTACAAGCCCCCACTGAGCC 
TGCCACTGCGGCCAGGAGCCTGGACTGTTCGATTGCTTCAGTTCTGGGAGCCCCTGGGTGAGACC 
CGCTTCCTCGTGCTGCCATTGACCTTCAACCACAAACTACCTCTCAGGAAAGATGATGCCAGCTG 
GCTGCATGCGGGACCACCCCACAACGAATACATGGAACAGAGTTTCCAGGGACTAAGTGGCATCC 
TGAATCTGCCTCAGCCAGAGGCCGTGGAGGAGGCTGCCCGGCGGCACACAGAGCTCACAGGTCCT 
GCACTTGAGGCCTGGACAGATGGGGAACTGAGCAGTTTCTGGTCTGTTGCAGGATTGTGTGCCAT 
AGGGCCTTCTTCTTGTCCCTCCCTGGAGCTCTGCAGACTGACCAGCTGGAGCTCTCTGTCTCCTG 
ACCCCAAGTCAGAGCTGGGGCCTGTCAAAGCTGACGGACGACTCAGGTAGCAGGACCCCAGCCAG 
CACAACCCGGAGGAGCCGGGGAATTGCACCTTACAGACAATGGAGGGACGTCTCTCCTCTGGTAG 
CCAGGGAGCCCTAGAACAGTGGTGGGGCTGTCTGTCTGTCAGGCTTTCCTAGGAAATGGACGACA 
TCAGGCCCTGCTGAGATCCATGAAGTTTCCCCACTGGAGGAGGGAGCCCCACTGGCAGCACAG 
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SEQUENCE LISTING 

<110> Kleesiek, Knut 

<120> Xylosyltransferase and isoforms thereof 

<130> Xylosyltransferase 

<140> 
<141> 

<160> 36 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 3726 

<212> DNA 

<213> Homo sapiens 

<220> " 

<221> CDS 

<222> (1) . . (2484) 

<223> human XT-I (not completely) 

<220> 

<221> misc_binding 
<222> (544) . . (552) 

<223> binding motif for nucleotide-sugars 
<220> 

<221> misc_feature 
<222> (280) . . (282) 
<223> N-glycosylation site 

<220> 

<221> misc_feature 
<222> (865) . . (867) 
<223> N-glycosylation site 

<220> 

<221> misc_f eature 
<222> (1933) . . (1935) 
<223> N-glycosylation site 

<400> 1 

act cag gat ggc tac ttt tct cat egg ccg aaa gag aaa gtg cga aca 48 

Thr Gin Asp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
15 10 15 

gac age aac aac gag aac tct gtc ccc aaa gac ttt gag aat gtg gac 96 
Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp. Phe Glu Asn Val Asp 
20 25 30 

aac age aac ttc gca ccc agg act caa aag cag aag cac cag cct gag 144 
Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
35 40 45 



ttg gcg aag aag cca ccg agt aga cag aag gag ctt ttg aaa agg aag 
Leu Ala Lys Lys Pro Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
50 55 60 



192 
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ctg gaa cag cag gag aaa gga aaa gga cat aca ttc cct ggg aaa ggc 240 
Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
65 70 75 80 

ccc ggt gag gtg ctg cct ccc ggg gac aga gcc gca gcc aac age age 288 
Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 
85 90 95 

cac ggg aag gat gtg tec aga ccg cct cat gcc agg aaa act ggg ggc 336 
His Gly Lys Asp Val Ser Arg Pro Pro His Ala Arg Lys Thr Gly Gly 
100 105 110 

age tec ccc gag acc aag tat gac cag ccc cct aag tgt gac ate tea 384 
Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp He Ser 
115 120 125 

ggc aag gag gcc ate tct gcc ctg tec cgt get aag tec aag cac tgc 432 
Gly Lys Glu Ala He Ser Ala Leu Ser Arg Ala .Lys Ser Lys His Cys 
1*30 135 140 

cgc cag gag att ggg gag act tac tgc cgc cac aag tta ggg ctg ctg 480 
Arg Gin Glu He Gly Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu 
145 150 155 160 

atg cct gag aag gtg act egg ttc tgc ccc etc gag ggt aaa gcc aac 528 
Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
165 170 175 

aag aac gtg cag tgg gac gag gac tec gtg gag tac atg cca gcc aac 576 
Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
180 185 190 

ccg gtc aga ate gcc ttt gtc ctg gtg gtc cac ggc cgt gcc tct egg 624 
Pro Val Arg He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
195 200 205 

cag ttg cag cgc atg ttc aag gcc ate tac cac aaa gac cac ttc tac 672 
Gin Leu Gin Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr 
210 215 220 

tac ate cac gtg gac aag cgc tct aat tac ctg cat egg caa gtg etc 720 
Tyr He His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
225 230 235 240 

cag gtc tec agg cag tac age aat gtc cgc gtc acc ccc tgg aga atg 768 
Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
245 250 255 

gcc acc ate tgg gga gga gcc age etc ctg tec acc tac ctg cag age 816 
Ala Thr He Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 
260 265 270 

atg egg gac etc ctg gag atg acc gac tgg ccc tgg gac ttc ttc ate 864 
Met Arg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He 
275 280 285 

aac ctg agt gcg gcc gac tac ccc ate agg aca aat gac cag ttg gtg 912 
Asn Leu Ser Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val 
290 295 300 

gcg ttt etc tec cga tac cga gat atg aat ttc ttg aag tea cac ggc 960 
Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
305 310 315 320 
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egg gac aat gca agg ttc att egg aag cag ggc ctg gat egg etc ttc 1008 
Arg Asp Asn Ala Arg Phe He Arg Lys Gin Gly Leu Asp Arg Leu Phe 
325 330 335 

ctg gag tgc gac get cac atg tgg cgc ctg gga gat egg egg ate cca 1056 
Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg He Pro 
340 345 350 

gag ggc att gec gtg gat ggc ggt teg gac tgg ttc ctg ctg aac egg 1104 
Glu Gly He Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
355 360 365 

agg ttt gta gaa tat gtg acc ttc tec aca gac gat ctg gtg ace aag 1152 
Arg, Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
370 375 380 

atg aaa cag ttc tac tec tac acc ctg ctt cct get gag tec ttc ttc 1200 
Met Lys Gin Phe Tyr Ser Tyr Thr Ifeu Leu Pro Ala Glu Ser Phe Phe 
385 ' 390 395 400 

cat acg gtc ctg gag aac age ccc cac tgc gac acc atg gtg gac aac 1248 
His Thr Val Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
405 410 415 

aac ctg cgc ate acc aac tgg aat cgc aag ctg ggc tgc aag tgc cag 1296 
Asn Leu Arg He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin 
420 425 430 

tac aag cac ate gtg gac tgg tgc ggc tgc tec ccc aat gac ttc aag 1344 
Tyr Lys His He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys 
435 440 445 

ccg cag gac ttc cac cgc ttc cag cag aca gee egg cct acc ttc ttt 1392 
Pro Gin Asp Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe 
450 455 460 

gec cgc aag ttt gaa gee gtg gtg aat cag gaa ate att ggg cag ctg 1440 
Ala Arg Lys Phe Glu Ala Val Val Asn Gin Glu He He Gly Gin Leu 
465 470 475 480 

gac tat tac ctg tac ggg aac tac cct gca ggt acc ccg ggc ctg cgc 1488 
Asp Tyr Tyr Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg 
485 490 495 

tec tac tgg gag aat gtc tac gat gag cct gac ggc ate cac age ctg 1536 
Ser Tyr Trp Glu Asn Val Tyr Asp Glu Pro Asp Gly He His Ser Leu 
500 505 510 

age gac gtg aca etc acc ttg tac cac tec ttt gee cgc ctg ggt ctt 1584 
Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Gly Leu 
515 520 525 

cga egg get gag acg tec ctg cac acg gat ggg gag aac age tgc cga 1632 
Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg 
530 535 540 

tac tac cca atg ggc cac cca gca tct gtg cac etc tac ttc ctt get 1680 
Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
545 550 555 560 

gac cgc ttc cag ggc ttt ctg ate aag cat cat get acc aat ctg get 1728 
Asp Arg Phe Gin Gly Phe Leu lie Lys His His Ala Thr Asn Leu Ala 
565 570 575 
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gtg age aaa eta gag act ctg gag acc tgg gtg atg ccg aaa aaa gtc 1776 

Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 
580 585 590 

ttc aag ate gca age cca ccc agt gac ttt ggg agg ctt cag ttt tec 1824 
Phe Lys He Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 
595 600 605 

gag gtc ggc act gac tgg gat gee aag gag agg eta ttc cgc aac ttt 1872 
Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
610 615 620 

ggg ggt ctt ctg ggg ccc atg gat gag ccg gtg ggt atg cag aag tgg 1920 
Gly Gly Leu Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp 
625 630 635 640 

ggg aag gga cct aat gtg acc gtg acc gtc att tgg gtg gat ccc gtc 1968 
Gly Lys Gly Pro Asn Val Thr Val Ehr Val He .Trp Val Asp Pro Val 
645 650 655 

aat gtc ate gca gec acc tac gac ate etc att gag tec act gee gaa 2016 
Asn Val He Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu 
660 665 670 

ttc aca cac tac aag ccc cct ttg aac ttg ccc ctg agg cct ggg gtc 2064 
Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
675 680 685 

tgg aca gtg aaa att etc cac cac tgg gtg cca gtt gca gag acc aaa 2112 
Trp Thr Val Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys 
690 695 700 

ttc etc gtt gcg cct ctg acc ttc teg aac agg cag ccc ate aaa cct 2160 
Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro He Lys Pro 
705 710 715 720 

gag gag gca ctg aag ctg cac aat ggg ccc etc cgc aat gec tac atg 2208 
Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
725 730 735 

gag cag age ttc cag age eta aac ccc gtc etc age ctg ccc ate aac 2256 
Glu Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro He Asn 
740 745 750 

ccc gee cag gtg gaa cag gca egg agg aac gca gee tec acg ggc aca 2304 
Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
755 760 765 

gcg ctg gag gga tgg ctg gac teg ttg gtg ggc ggg atg tgg act gee 2352 
Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
770 775 780 

atg gac ate tgt gec acg ggc ccc aca gee tgc ccg gtc atg cag acc 2400 
Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
785 790 795 800 

tgc age cag acg gee tgg age tec ttc age cct gac ccc aag teg gag 2448 
Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
805 810 815 



ctg ggt gee gtc aaa cct gat ggc egg etc agg tag cactgggcac 
Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arg 
820 825 
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gaggagtggg ccacagcagg atctcaacgg gaaagcagcc agaggggttg tggggcctga 2554 
accccggcct ccaccctggg ggaggccctc tgtgaatggg tctctcctgg ccatagaatg 2614 
atggaaaggg aaggtcagca ggtcaaagca ggatcagcca acaacctgcc tttggcaagc 2674 
tgcgggtggg atggctcagt ccctgcactg tgactgtctc acctcttctg sttgatcctc 2734 
aagtcctaca ggttccttgt cttccccttc cagtgaccca cccctgaccc cagacgtgtg 2794 
attttcagac ttttctttcg agcagcagaa cttcgtttac ggagcacagt cataagtgra 2854 
ggttcagggt gctgacgaaa tccaagctgc tctggttgaa gctgacaagt gcgaggttcc 2914 
ctcccaaagc tcagccctct gggcggtccc cttgcccagg gtatctccta cggtacctct 2974 
tcagaaccca agggctctgc aaatgccagt ttgacaagca ctgcccagac caaccatggg 3034 
ttcaaactcc agccctgccc tttggttcat ttttctgctt ctcttggctg ggggactctg 3094 
gtgccagcct tgaaagtcat ggtcgtgggc cctttcccat ggaggctgca gccttaggag 3154 
agctctgagc ctctcagcag ccctccttgg gttgaactat tctccttagt aactaggtaa 3214 
gtgggaaagc cttttgatgt ggcatggcca aggtccagcc acaagtgcaa ctgccacctg 3274 
tccaggggtc tgggcctcct tccctcaagg ctgccacaca aagtagcaga aataggatga 3334 
tgtttgtgag caccagactc aagaccatga ccttctttga tccttgaaaa tgggaacttt 3394 
gacagccatg accatgaaac tcacaaggca acgcggatga aactcacaaa gcaatgcttg 3454 
gagcaaaact cctgagctag acagcacagc agcacccatc ccctgccaga gcccttccgt 3514 
ttgaggtcag acacacaaaa ccttcgtcaa -ttgcacaccg gtgctgttgg gagtgaccaa 3574 
accacatgaa ccagactttt cccgtccagg aaatagcatt tcagatttgg tttttaattt 3634 
catgcccttc ggccacaggc tcaacgggac atgcaacata aaaatgggaa ggttattcaa 3694 
cccgcgggaa aaaaaaaaaa aaaaaaaaaa aa 3726 



<210> 2 
<211> 827 
<212> PRT 

<213> Homo sapiens 
<4O0> 2 

Thr Gin Asp Gly Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr 
15 10 15 

Asp Ser Asn Asn Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp 
20 25 30 

Asn Ser Asn Phe Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu 
35 40 45 

Leu Ala Lys Lys Pro Pro Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys 
50 55 60 

Leu Glu Gin Gin Glu Lys Gly Lys Gly His Thr Phe Pro Gly Lys Gly 
65 70 75 80 

Pro Gly Glu Val Leu Pro Pro Gly Asp Arg Ala Ala Ala Asn Ser Ser 
85 90 95 

His Gly Lys Asp Val Ser Arg Pro Pro His Ala Arg Lys Thr Gly Gly 
100 105 110 

Ser Ser Pro Glu Thr Lys Tyr Asp Gin Pro Pro Lys Cys Asp lie Ser 
115 120 125 

Gly Lys Glu Ala lie Ser Ala Leu Ser Arg Ala Lys Ser Lys His Cys 
130 135 140 

Arg Gin Glu lie Gly Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu 
145 150 155 160 

Met Pro Glu Lys Val Thr Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn 
165 170 175 
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Lys Asn Val Gin Trp Asp Glu Asp Ser Val Glu Tyr Met Pro Ala Asn 
180 185 190 

Pro Val Arg He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg 
195 200 205 

Gin Leu Gin Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr 
210 215 220 

Tyr He His Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu 
225 230 235 240 

Gin Val Ser Arg Gin Tyr Ser Asn Val Arg Val Thr Pro Trp Arg Met 
245 250 255 

Ala Thr He Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser 



Met Arg Asp Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He 
275 280 285 

Asn Leu Ser Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val 
290 295 300 

Ala Phe Leu Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly 
305 310 315 320 

Arg Asp Asn Ala Arg Phe He Arg Lys Gin Gly Leu Asp Arg Leu Phe 
325 330 335 

Leu Glu Cys Asp Ala His Met Trp Arg Leu Gly Asp Arg Arg lie Pro 
340 345 350 

Glu Gly He Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg 
355 360 365 

Arg Phe Val Glu Tyr Val Thr Phe Ser Thr Asp Asp Leu Val Thr Lys 
370 375 380 

Met Lys Gin Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe 
385 390 395 400 

His Thr Val Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn 
405 410 415 

Asn Leu Arg He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin 
420 425 430 

Tyr Lys His He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys 
435 440 445 

Pro Gin Asp Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe 
450 455 460 

Ala Arg Lys Phe Glu Ala Val Val Asn Gin Glu He He Gly Gin Leu 
465 470 475 480 

Asp Tyr Tyr Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg 



260 



265 



270 



4 



485 



4 90 



495 



Ser Tyr Trp Glu Asn Val Tyr Asp Glu Pro Asp Gly He His Ser Leu 
500 505 510 
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Ser Asp Val Thr Leu Thr Leu Tyr His Ser Phe Ala Arg Leu Gly Leu 
515 520 525 

Arg Arg Ala Glu Thr Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg 
530 535 540 

Tyr Tyr Pro Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala 
545 550 555 560 

Asp Arg Phe Gin Gly Phe Leu He Lys His His Ala Thr Asn Leu Ala 
565 570 575 

Val Ser Lys Leu Glu Thr Leu Glu Thr Trp Val Met Pro Lys Lys Val 
580 585 590 

Phe Lys He Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser 
595 600 605 

Glu Val Gly Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe 
610 615 620 

Gly Gly Leu Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp 
625 630 635 640 

Gly Lys Gly Pro Asn Val Thr Val Thr Val He Trp Val Asp Pro Val 
645 650 655 

Asn Val He Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu 
660 665 670 

Phe Thr His Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val 
675 680 685 

Trp Thr Val Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys 
690 695 700 

Phe Leu Val Ala Pro Leu Thr Phe Ser Asn Arg Gin Pro He Lys Pro 
705 710 715 720 

Glu Glu Ala Leu Lys Leu His Asn Gly Pro Leu Arg Asn Ala Tyr Met 
725 730 735 

Glu Gin Ser Phe Gin Ser Leu Asn Pro Val Leu Ser Leu Pro He Asn 
740 745 750 

Pro Ala Gin Val Glu Gin Ala Arg Arg Asn Ala Ala Ser Thr Gly Thr 
755 760 765 

Ala Leu Glu Gly Trp Leu Asp Ser Leu Val Gly Gly Met Trp Thr Ala 
770 775 780 

Met Asp He Cys Ala Thr Gly Pro Thr Ala Cys Pro Val Met Gin Thr 
785 790 795 800 

Cys Ser Gin Thr Ala Trp Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu 
805 810 815 



Leu Gly Ala Val Lys Pro Asp Gly Arg Leu Arg 
820 825 
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<210> 3 

<211> 3608 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (150) . . (2744) 
<223> human XT-II 

<220> 

<221> misc_feature 
<222> (513) . . (515) 
<223> N-glycosylation site 

<220> 

<221> mis cofeature 
<222> (1128) . . (1130) 
<223> N-glcosylation site 

<220> 

<221> misc_feature 
<222> (2196) . . (2198) 
<223> N-glycosylation site 

<400> 3 

cgcaggcccc gcccccggcc ccgccccggc gcgcgcccgg cccgcctccc taggcgtgga 60 
ggaggggggg cggctcagcc ccgcgcccgt gcgcgcgctc gcggccgggt tgcagggctg 120 
ggcgcgcgcc ccgcgtcccg ggcaggaag atg gtg gcg age gcg cga gtg cag 173 

Met Val Ala Ser Ala Arg Val Gin 
1 5 

aag ctg gtg egg cgc tac aag ctg gcg att gec acg gcg ctg gec ate 221 
Lys Leu Val Arg Arg Tyr Lys Leu Ala lie Ma Thr Ala Leu Ala lie 
10 15 20 

ctg ctg ctg cag ggc ctg gta gtg tgg age ttc age ggc ctg gag gag 269 
Leu Leu Leu Gin Gly Leu Val Val Trp Ser Phe Ser Gly Leu Glu Glu 
25 30 35 40 

gac gag gcg ggc gag aaa gga agg cag agg aag cca egg cca ctg gac 317 
Asp Glu Ala Gly Glu Lys Gly Arg Gin Arg Lys Pro Arg Pro Leu Asp 
45 50 55 

cct ggc gaa ggt tec aag gac aca gac agt tea gca ggg cga egg ggc 365 
Pro Gly Glu Gly Ser Lys Asp Thr Asp Ser Ser Ala Gly Arg Arg Gly 
60 65 70 

age aca ggc aga agg cat ggg cgc tgg egg ggc cgt get gag age cca 413 
Ser Thr Gly Arg Arg His Gly Arg Trp Arg Gly Arg Ala Glu Ser Pro 
75 80 85 

gga gtg ccc gtg gec aag gtg gta egg gca gta ace age egg cag aga 4 61 
Gly Val Pro Val Ala Lys Val Val Arg Ala Val Thr Ser Arg Gin Arg 
90 95 100 

gee age egg egg gtc cca cct gec cca cct ccg gaa gec cca ggc cgc 509 
Ala Ser Arg Arg Val Pro Pro Ala Pro Pro Pro Glu Ala Pro Gly Arg 
105 110 115 120 



cag aac ctg agt ggg gca gca get ggg gag gcg ctg gta ggg gca get 
Gin Asn Leu Ser Gly Ala Ala Ala Gly Glu Ala Leu Val Gly Ala Ala 
125 130 135 



557 
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ggc ttc cca cca cac gga gat aca ggg age gtg gag ggc gec ccc cag 605 
Gly Phe Pro Pro His Gly Asp Thr Gly Ser Val Glu Gly Ala Pro Gin 
140 145 150 

ccc acg gac aat ggc ttc acc ccc aag tgc gag ate gtg ggc aag gac 653 
Pro Thr Asp Asn Gly Phe Thr Pro Lys Cys Glu lie Val Gly Lys Asp 
155 160 165 

gca ctg tct gca ctg gec egg gee age acc aag cag tgc cag cag gag 701 
Ala Leu Ser Ala Leu Ala Arg Ala Ser Thr Lys Gin Cys Gin Gin Glu 
170 175 180 

ate gec aat gtg gtg tgc ctg cac cag get ggg age etc atg ccc aag 749 
lie Ala Asn Val Val Cys Leu His Gin Ala Gly Ser Leu Met Pro Lys 
185 190 195 200 

get gtg ccc egg cac tgt cag ctg act ggg aag atg age ccc ggc ate 797 
Ala Val Pro Arg His Cys Gin Leu Thr Gly Lys -Met Ser Pro Gly lie 
205 210 215 

cag tgg gat gag age caa gee cag cag ccc atg gat ggc ccc ccg gtg 845 
Gin Trp Asp Glu Ser Gin Ala Gin Gin Pro Met Asp Gly Pro Pro Val 
220 225 230 

cga ate gee tac atg ctg gtg gtt cac ggc cgc gec ate cgc cag ctg 893 
Arg He Ala Tyr Met Leu Val Val His Gly Arg Ala He Arg Gin Leu 
235 240 245 

aag cgt etc etc aag gec gtt tat cac gag cag cac ttc ttt tac ate 941 
Lys Arg Leu Leu Lys Ala Val Tyr His Glu Gin His Phe Phe Tyr He 
250 255 260 

cat gtg gac aag cgt tec gac tac ctg cac egg gag gtg gtg gag ctg 989 
His Val Asp Lys Arg Ser Asp Tyr Leu His Arg Glu Val Val Glu Leu 
265 270 275 2B0 

gec cag ggc tat gat aac gtg egg gtg acg ccc tgg cgc atg gtt acc 1037 
Ala Gin Gly Tyr Asp Asn Val Arg Val Thr Pro Trp Arg Met Val Thr 
285 290 295 

ate tgg ggc ggg gec age etc ctg acg atg tac ctg egg age atg egg 1085 
He Trp Gly Gly Ala Ser Leu Leu Thr Met Tyr Leu Arg Ser Met Arg 
300 305 310 

gac ctg eta gag gtg cct ggc tgg gee tgg gac ttc ttc ate aac etc 1133 
Asp Leu Leu Glu Val Pro Gly Trp Ala Trp Asp Phe Phe He Asn Leu 
315 320 325 

agt gec act gac tat cca acc agg acc aat gag gag ctg gtg gca ttc 1181 
Ser Ala Thr Asp Tyr Pro Thr Arg Thr Asn Glu Glu Leu Val Ala Phe 
330 335 340 

eta tec aag aac egg gac aag aat ttc etc aag tea cat ggc egg gac 1229 
Leu Ser Lys Asn Arg Asp Lys Asn Phe Leu Lys Ser His Gly. Arg Asp 
345 350 355 360 

aac tec agg ttc ate aag aaa cag ggc ctg gac egg etc ttc cat gag 1277 
Asn Ser Arg Phe He Lys Lys Gin Gly Leu Asp Arg Leu Phe His Glu 
365 370 375 

tgc gac tea cac atg tgg cgc ctg ggc gag egg cag ate cca gca ggc 1325 
Cys Asp Ser His Met Trp Arg Leu Gly Glu Arg Gin He Pro Ala Gly 
380 385 390 
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att gtg gtg gat ggc ggt tct gac tgg ttc gtg ctg aca cgc age ttt 1373 

lie Val Val Asp Gly Gly Ser Asp Trp Phe Val Leu Thr Arg Ser Phe 
395 400 405 

gtg gag tat gtg gtg tac aca gat gac ccg ctt gtg gec cag ctg cgc 1421 
Val Glu Tyr Val Val Tyr Thr Asp Asp Pro Leu Val Ala Gin Leu Arg 
410 415 420 

cag ttc tac aca tac aca ctg etc cca gec gag tec ttc ttc cac acg 14 69 
Gin Phe Tyr Thr Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr 
425 430 435 440 

gtg ctg gag aac age ctg gec tgt gag ace etc gtg gac aac aac ctg 1517 
Val Leu Glu Asn Ser Leu Ala Cys Glu Thr Leu Val Asp Asn Asn Leu 
445 450 455 

egg gtc ace aac tgg aac cgc aag ctg ggc tgc aag tgc cag tac aag 1565 
Arg Val Thr Asn Trp Asn Arg Lys l?eu Gly Cys -Lys Cys Gin Tyr Lys 
460 465 470 

cac att gtg gac tgg tgt ggc tgc tec ccc aac gac ttc aag cca cag 1613 
His lie Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin 
475 480 485 

gac ttc etc egg ctg cag caa gtc tec aga ccc ace ttc ttc gee egg 1661 
Asp Phe Leu Arg Leu Gin Gin Val Ser Arg Pro Thr Phe Phe Ala Arg 
490 495 500 

aag ttc gag teg act gtg aac cag gag gtg ctg gaa ate ctg gac ttc 1709 
Lys Phe Glu Ser Thr Val Asn Gin Glu Val Leu Glu He Leu Asp Phe 
505 510 515 520 

cac ctg tac ggc age tac ccc ccc ggc acg cca gec etc aag gec tac 1757 
His Leu Tyr Gly Ser Tyr Pro Pro Gly Thr Pro Ala Leu Lys Ala Tyr 
525 530 535 

tgg gag aac acc tac gac gcg get gat ggc ccc agt ggg etc agt gat 1805 
Trp Glu Asn Thr Tyr Asp Ala Ala Asp Gly Pro Ser Gly Leu Ser Asp 
540 545 550 

gtc atg etc act get tac aca gee ttc gee cgc etc age ctg cac cat 1853 
Val Met Leu Thr Ala Tyr Thr Ala Phe Ala Arg Leu Ser Leu His His 
555 560 565 

gec gee act get gca ccc cca atg ggc acc cca etc tgc agg ttt gag 1901 
Ala Ala Thr Ala Ala Pro Pro Met Gly Thr Pro Leu Cys Arg Phe Glu 
570 575 580 

ccc agg ggc ttg ccg tec age gtg cac ctg tat ttc tat gac gac cat 194 9 
Pro Arg Gly Leu Pro Ser Ser Val His Leu Tyr Phe Tyr Asp Asp His 
585 590 595 600 

ttc cag ggc tac ctg gtg acg cag gcg gtg cag ccc tea gee cag ggg 1997 
Phe Gin Gly Tyr Leu Val Thr Gin Ala Val Gin Pro Ser Ala Gin Gly 
605 610 615 

ccg gca gag acg ctt gag atg tgg ctg atg ccc caa ggg teg ctg aag 2045 
Pro Ala Glu Thr Leu Glu Met Trp Leu Met Pro Gin Gly Ser Leu Lys 
620 625 630 

ctg ttg ggg cgc agt gac cag gec age egg etc cag agt ctg gag gtt 2093 
Leu Leu Gly Arg Ser Asp Gin Ala Ser Arg Leu Gin Ser Leu Glu Val 
635 640 645 
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ggc act gat tgg gac ccc aaa gag cgt ctt ttc egg aac ttt ggg ggg 
Gly Thr Asp Trp Asp Pro Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly 
650 655 660 



2141 



tta ctg ggg ccg ctg gac gag cct gtg gec gtg cag cgc tgg gec egg 2189 
Leu Leu Gly Pro Leu Asp Glu Pro Val Ala Val Gin Arg Trp Ala Arg 
665 670 675 680 



ggc ccc aac etc aca gec aca gtg gtc tgg ate gac cca ace tat gtg 
Gly Pro Asn Leu Thr Ala Thr Val Val Trp lie Asp Pro Thr Tyr Val 
685 690 695 



2237 



gtg gee aca tct tat gac ate aca gta gat acg gag act gag gtc acg 2285 
Val Ala Thr Ser Tyr Asp He Thr Val Asp Thr Glu Thr Glu Val Thr 
700 705 710 



caa tac aag ccc cca ctg age egg ccc ctg egg cca ggg ccc tgg act 
Gin Tyr Lys Pro Pro Leu Ser Arg Pro Leu Arg -Pro Gly Pro Trp Thr 
715 720 725 



2333 



gtt cga etc ctt cag ttc tgg gaa ccg ctg ggt gag acc cgc ttc ctt 2381 
Val Arg Leu Leu Gin Phe Trp Glu Pro Leu Gly Glu Thr Arg Phe Leu 
730 735 740 

gtg ctg ccc ttg acc ttc aac cgc aaa eta cct etc agg aaa gat gat 2429 
Val Leu Pro Leu Thr Phe Asn Arg Lys Leu Pro Leu Arg Lys Asp Asp 
745 750 755 760 



gec age tgg ctg cac gca ggg cca ccc cac aac gag tac atg gag cag 
Ala Ser Trp Leu His Ala Gly Pro Pro His Asn Glu Tyr Met Glu Gin 
765 770 775 



2477 



agt ttc cag ggc ctg agt age ate ctg aac ctg cct cag ccg gag etc 
Ser Phe Gin Gly Leu Ser Ser He Leu Asn Leu Pro Gin Pro Glu Leu 
780 785 790 



2525 



gcg gag gag get gec cag egg cac aca cag etc aca ggc cct gcg etc 
Ala Glu Glu Ala Ala Gin Arg His Thr Gin Leu Thr Gly Pro Ala Leu 
795 800 805 



2573 



gag gec tgg aca gac agg gaa ctg age age ttc tgg tec gtg get gga 
Glu Ala Trp Thr Asp Arg Glu Leu Ser Ser Phe Trp Ser Val Ala Gly 
810 815 820 



26Z1 



ctg tgt gec ata ggc ccc tct ccc tgc ccc tec ctg gag ccc tgc aga 
Leu Cys Ala He Gly Pro Ser Pro Cys Pro Ser Leu Glu Pro Cys Arg 
825 830 835 840 



2669 



ctg acc age tgg age tct ctg tec ccc gac ccc aaa tea gag ctg ggg 2717 
Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp Pro Lys Ser Glu Leu Gly 
845 850 855 

cct gtc aaa gca gac ggg cga etc agg tagcagggee ccagccagta 27 64 

Pro Val Lys Ala Asp Gly Arg Leu Arg 
860 865 



cccgtggagg 
gcaagaacca 
cagggaaggt 
gegggaaege 
ctagtttgaa 
gggtggttgg 
aagtgggcca 



accegggaaa 
gaggeccagg 
ggacacagta 
gagaagggca 
tttctttttt 
gagagaaact 
tgtctgtggg 



ttgeacctta 
ctgcacaccc 
tgaactactg 
ccagcagcat 
ttctttcttt 
agaaeagaag 
gagcagtggc 



cagacagtgg 
atttcageca 
ctgatgtctc 
tccacaccca 
tttttttttt 
atgtgcaata 
ttgetgagee 



aggggtgtcc 
tcaagaaccc 
tgttggggat 
gctcttcctc 
tttaatttaa 
gggctcagag 
cacctgctat 



cctcccacag 2824 
acacagaegg 2884 
cagagggctg 2944 
accttcctgt 3004 
aaaggaaaat 3064 
cagccccagg 3124 
tgttctgcac 3184 
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gaggctgggc ctgcctcact ctccagggcc 
tggctgaggc tccacagggc tgcaagtgcc 
gatatcgggg gcgctgcaga gctgggctct 
cctagaagtg gggtggggcg gctccagggc 
aggcagggtc agggcccact gagacccatg 
gtcacatgag cagcgtggga agaagactct 
tgtgcctgaa gctcagcgtg aagtctgaaa 
tcta 
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tcatgcccca ttctgggcct gtggtgctcg 3244 

ctgccaggct ctaaggcccg aagaacaggt 3304 

agcaggacag ttcttttgta gccaggggac 3364 

tttccagcat accctgcccc tggatgggaa 3424 

cagagtttcc aggtgtgcac tggaagtgag 3484 

gtcaagactc tcagaagaac ctggagtaat 354 4 

tatgcaacag aagaaatata tctctatctc 3604 

3608 



<210> 4 
<211> 865 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu 
1 5 - 10 15 

Ala lie Ala Thr Ala Leu Ala lie Leu Leu Leu Gin Gly Leu Val Val 
20 25 30 

Trp Ser Phe Ser Gly Leu Glu Glu Asp Glu Ala Gly Glu Lys Gly Arg 
35 40 45 

Gin Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr 
50 55 60 

Asp Ser Ser Ala Gly Arg Arg Gly Ser Thr Gly Arg Arg His Gly Arg 
65 70 75 80 

Trp Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val 
85 90 95 

Arg Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala 
100 105 110 

Pro Pro Pro Glu Ala Pro Gly Arg Gin Asn Leu Ser Gly Ala Ala Ala 
115 120 125 

Gly Glu Ala Leu Val Gly Ala Ala Gly Phe Pro Pro His Gly Asp Thr 
130 135 140 

Gly Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Gly Phe Thr Pro 
145 150 155 160 

Lys Cys Glu He Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala 
165 170 175 

Ser Thr Lys Gin Cys Gin Gin Glu He Ala Asn Val Val Cys Leu His 
180 185 190 

Gin Ala Gly Ser Leu Met Pro Lys Ala Val Pro Arg His Cys Gin Leu 
195 200 205 

Thr Gly Lys Met Ser Pro Gly He Gin Trp Asp Glu Ser Gin Ala Gin 
210 215 220 

Gin Pro Met Asp Gly Pro Pro Val Arg He Ala Tyr Met Leu Val Val 
225 230 235 240 

His Gly Arg Ala He Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr 
245 250 255 
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His Glu Gin His Phe Phe Tyr lie His Val Asp Lys Arg Ser Asp Tyr 

260 265 270 

Leu His Arg Glu Val Val Glu Leu Ala Gin Gly Tyr Asp Asn Val Arg 
275 280 285 

Val Thr Pro Trp Arg Met Val Thr He Trp Gly Gly Ala Ser Leu Leu 
290 295 300 

Thr Met Tyr Leu Arg Ser Met Arg Asp Leu Leu Glu Val Pro Gly Trp 
305 310 315 320 

Ala Trp Asp Phe Phe He Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg 
325 330 335 

Thr Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn 
340 345 350 

Phe Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe He Lys Lys Gin 
355 360 365 

Gly Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu 
370 375 380 

Gly Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly Gly Ser Asp 
385 390 395 400 

Trp Phe Val Leu Thr Arg Ser Phe Val Glu Tyr Val Val Tyr Thr Asp 
405 • 410 415 

Asp Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu 
420 425 430 

Pro Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Leu Ala Cys 
435 440 445 

Glu Thr Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys 
450 455 460 

Leu Gly Cys Lys Cys Gin Tyr Lys His He Val Asp Trp Cys Gly Cys 
465 470 475 480 

Ser Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val 
485 490 495 

Ser Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin 
500 505 510 

Glu Val Leu Glu He Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro 
515 520 525 

Gly Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn Thr Tyr Asp Ala Ala 
530 535 540 

Asp Gly Pro Ser Gly Leu Ser Asp Val Met Leu Thr Ala Tyr Thr Ala 
545 550 555 560 

Phe Ala Arg Leu Ser Leu His His Ala Ala Thr Ala Ala Pro Pro Met 
565 570 575 



Gly Thr Pro Leu Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val 
580 585 590 



WO 01/49831 PCT/EPOO/13311 

14 / 32 

His Leu Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin 

595 600 605 

Ala Val Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp 
610 615 620 

Leu Met Pro Gin Gly Ser Leu Lys Leu Leu Gly Arg Ser Asp Gin Ala 
625 630 635 640 

Ser Arg Leu Gin Ser Leu Glu Val Gly Thr Asp Trp Asp Pro Lys Glu 
645 650 655 

Arg Leu Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro 
660 665 670 

Val Ala Val Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val 
675 680 685 

Val Trp He Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr 
690 695 700 

Val Asp Thr Glu Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Arg 
705 710 715 720 

Pro Leu Arg Pro Gly Pro Trp Thr Val Arg Leu Leu Gin Phe Trp Glu 
725 730 735 

Pro Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn Arg 
740 745 750 

Lys Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro 
755 760 765 

Pro His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Ser He 
770 775 780 

Leu Asn Leu Pro Gin Pro Glu Leu Ala Glu Glu Ala Ala Gin Arg His 
785 790 795 800 

Thr Gin Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Arg Glu Leu 
805 810 815 

Ser Ser Phe Trp Ser Val Ala Gly Leu Cys Ala He Gly Pro Ser Pro 
820 825 830 

Cys Pro Ser Leu Glu Pro Cys. Arg Leu Thr Ser Trp Ser Ser Leu Ser 
835 840 845 

Pro Asp Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu 
850 855 860 

Arg 
865 

<210> 5 
<211> 2463 
<212> DNA 

<213> Rat coronavirus 

<220> 

<221> CDS 

<222> (1) . . (2463) 

<223> rat XT- I 

<400> 5 
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tac ttc tct cac egg ccc aaa gag aaa gtt egg ace gac age aac aat 4 8 
Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr Asp Ser Asn Asn 
15 10 15 

gag aac tea gtc ccc aag gac ttt gag aat gtg gac aac age aac ttc 96 
Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp Asn Ser Asn Phe 
20 25 30 

gca ccc agg act caa aag cag aag cac cag cca gag ttg gca aag aag 144 
Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu Leu Ala Lys Lys 
35 40 45 

ccc etc age agg cag aaa gag cgt ttg cag aga aag ctg ggc gee cag 192 
Pro Leu Ser Arg Gin Lys Glu Arg Leu Gin Arg Lys Leu Gly Ala Gin 
50 55 60 

gac aaa gga cag ggg cag tea gtc eta gga aaa ggc ccc aag gag gtg 240 
Asp Lys Gly Gin Gly Gin Ser Val Beu Gly Lys -Gly Pro Lys Glu Val 
65 ' 70 75 80 

ctg cct cct egg gaa aaa gec cca ggc aac agt age caa ggg aag gat 288 
Leu Pro Pro Arg Glu Lys Ala Pro Gly Asn Ser Ser Gin Gly Lys Asp 
85 90 95 

etc tea aga cac age cat age agg aag agt ggt ggc ggt ggg tec ccg 336 
Leu Ser Arg His Ser His Ser Arg Lys Ser Gly Gly Gly Gly Ser Pro 
100 105 110 

gaa act aag tct gac cag gtc ccc aag tgt gac ate tct ggc aag gag 384 
Glu Thr Lys Ser Asp Gin Val Pro Lys Cys Asp lie Ser Gly Lys Glu 
115 120 125 

gee ate tea gcg ctg acc cgc get aag tec aag cac tgt cgc cag gag 432 
Ala lie Ser Ala Leu Thr Arg Ala Lys Ser Lys His Cys Arg Gin Glu 
130 135 140 

att gca gaa. acc tac tgt cgc cac aag ctg ggg ctg ctg atg cca gag 480 
He Ala Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu Met Pro Glu 
145 150 155 160 

aag gtg get cga ttc tgt ccc ctg gaa ggc aaa gee aac aag aat gtc 528 
Lys Val Ala Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn Lys Asn Val 
165 170 175 

cag tgg gat gag gat get gtt gag tac atg ccc ccc aac cca gtc aga 57 6 
Gin Trp Asp Glu Asp Ala Val Glu Tyr Met Pro Pro Asn Pro Val Arg 
180 185 190 

ate gee ttt gtc ctg gtg gtc cat ggc cgt gec tct cga cag ctg cag 624 
He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg Gin Leu Gin 
195 200 205 

cgc atg ttt aag gec ate tac cac aaa gac cat ttc tac tat ate cat 672 
Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr Tyr He His 
210 215 220 

gtg gat aag cgt tec aat tac ctg cat egg caa gtg etc cag ttc tec 720 
Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu Gin Phe Ser 
225 230 235 240 



agg cag tac gac aac gtc cga gtc act tec tgg agg atg gee acc att 
Arg Gin Tyr Asp Asn Val Arg Val Thr Ser Trp Arg Met Ala Thr He 
245 250 255 



768 
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tgg ggc gga gcc age etc ctg tec act tac ctg cag age atg egg gat 816 
Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser Met Arg Asp 
260 265 270 

eta ctg gag atg act gac tgg ccc tgg gac ttc ttc ate aac ctg agt 864 
Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe lie Asn Leu Ser 
275 280 285 

gcg get gac tac ccc ate agg aca aat gac cag etc gta gca ttt ctt 912 
Ala Ala Asp Tyr Pro lie Arg Thr Asn Asp Gin Leu Val Ala Phe Leu 
290 295 300 

tec aga tac cga gat atg aat ttc ctg aag tea cat ggt egg gac aat 960 
Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly Arg Asp Asn 
305 310 315 320 

gca agg ttc ate egg aag cag gac ctg gac cgt etc ttc ctg gag tgt 1008 
Ala Arg Phe lie Arg Lys Gin Asp lieu Asp Arg Leu Phe Leu Glu Cys 
325 330 335 

gat aca cac atg tgg cgc ctg ggg gac egg cga ate cca gag ggc att 1056 
Asp Thr His Met Trp Arg Leu Gly Asp Arg Arg lie Pro Glu Gly lie 
340 345 350 

get gta gat ggc ggc tct gat tgg ttc ctg eta aac egg aag ttt gta 1104 
Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg Lys Phe Val 
355 360 365 

gag tat gtg gca ttc tec aca gat gac ctg gtg acc aag atg aag cag 1152 
Glu Tyr Val Ala Phe Ser Thr Asp Asp Leu Val Thr Lys Met Lys Gin 
370 375 380 

ttc tac tct tat acc etc etc cct get gag tec ttt ttc cac acg gtc 1200 
Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val 
385 390 395 400 

ctg gag aac age ccc cac tgt gac acc atg gtg gat aac aac ctg cgc 1248 
Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn Asn Leu Arg 
405 410 415 

ate acc aat tgg aac cgc aag ctg ggc tgc aag tgc cag tac aag cat 1296 
lie Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His 
420 425 430 

att gtg gac tgg tgt ggc tgc tct ccc aat gac ttc aag cct cag gac 1344 
He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp 
435 440 445 

ttc cat cgc ttc cag cag acg gcc egg ccc acc ttc ttt gcc cga aag 1392 
Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe Ala Arg Lys 
450 455 460 

ttc gaa gcc ata gtg aac cag gaa ate att ggg cag ttg gac tct tac 1440 
Phe Glu Ala He Val Asn Gin Glu He He Gly Gin Leu Asp Ser Tyr 
465 470 475 480 

ctg tat ggc aac tat cct gcg ggc acc ccg ggc etc cgc tct tac tgg 1488 
Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg Ser Tyr Trp 
485 490 495 

gag aat gtc tat gac gaa cca gat ggc ate cag age etc age gat gtg 1536 
Glu Asn Val Tyr Asp Glu Pro Asp Gly He Gin Ser Leu Ser Asp Val 
500 505 510 
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gca etc acc atg tat cat tec ttc ate cgc ctg ggt ctt cga agg get 1584 
Ala Leu Thr Met Tyr His Ser Phe He Arg Leu Gly Leu Arg Arg Ala 
515 520 525 

gag tea teg eta cac acg gat ggg gag aac age tgc agg tac tat cca 1632 
Glu Ser Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg Tyr Tyr Pro 
530 535 540 

atg ggc cac cca gcg tct gtc cac etc tac ttc ctg get gac cga ttc 1680 
Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala Asp Arg Phe 
545 550 555 560 

cag ggc ttt ctg ate aag cac cat gtg acc aac ctt get gtg age aaa 1728 
Gin Gly Phe Leu He Lys His His Val Thr Asn Leu Ala Val Ser Lys 
565 570 575 

ctg gag aca ctg gag aca tgg atg atg cca aag aaa gtc ttc aag gtc 177 6 
Leu Glu Thr Leu Glu Thr Trp Met Met Pro Lys -Lys Val Phe Lys Val 
580 585 590 

gca agt ccc ccc agt gac ttt ggg agg ctt cag ttt tct gag gtt ggc 1824 
Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser Glu Val Gly 
595 600 605 

act gac tgg gat gee aag gag aga eta ttc egg aac ttt ggt ggt ctt 1872 
Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly Leu 
610 615 620 

ttg ggg ccc atg gat gag ccg gtg gga atg cag aaa tgg ggg aag gga 1920 
Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp Gly Lys Gly 
625 630 635 640 

ccc aat gtg acc gtg act gtt att tgg gtg gat cct gtc aat gtc ate 1968 
Pro Asn Val Thr Val Thr Val lie Trp Val Asp Pro Val Asn Val He 
645 650 655 

gca gee acc tat gat ate ctg ate gag tec act gcg gaa ttc aca cac 2016 
Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu Phe Thr His 
660 665 670 

tac aaa ccc cct ttg aat ctg cct ctg agg cct ggt gtc tgg aca gtg 2064 
Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val Trp Thr Val 
675 680 685 

aag att etc cat cac tgg gta cca gtt gca gag acc aaa ttc ctt gtc 2112 
Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys Phe Leu Val 
690 695 700 

gca cct ctg acc ttc tec aac aag cag ccc ate aaa cca gag gag get 2160 
Ala Pro Leu Thr Phe Ser Asn Lys Gin Pro He Lys Pro Glu Glu Ala 
705 710 715 720 

ttg aag ctg cac aat ggg cca cct cgc agt gee tac atg gag cag agt 2208 
Leu Lys Leu His Asn Gly Pro Pro Arg Ser Ala Tyr Met Glu Gin Ser 
725 730 735 

ttc cag age eta aac cca gtc etc age ctg cac ate aat cct gee caa 2256 
Phe Gin Ser Leu Asn Pro Val Leu Ser Leu His He Asn Pro Ala Gin 
740 745 750 

gtg gag cag gee egg aag aac gca gee ttc act ggg aca gca eta gaa 2304 
Val Glu Gin Ala Arg Lys Asn Ala Ala Phe Thr Gly Thr Ala Leu Glu 
755 760 765 
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gcc tgg ctg gtg ggt ggg act tgg act gcc atg gac gtc tgt gcc aca 2352 

Ala Trp Leu Val Gly Gly Thr Trp Thr Ala Met Asp Val Cys Ala Thr 

770 775 780 

ggc ccc act gcc tgc cca gtc atg caa acc tgc age caa aca gcc tgg 2400 
Gly Pro Thr Ala Cys Pro Val Met Gin Thr Cys Ser Gin Thr Ala Trp 
785 790 795 800 

agt tec ttc age cct gat ccc aag tea gag ctg ggt gca gtg aag cct 2448 
Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu Leu Gly Ala Val Lys Pro 
805 810 815 

gat ggt egg etc agg 24 63 

Asp Gly Arg Leu Arg 
820 



<210> 6 
<211> 821 
<212> PRT 

<213> Rat coronavirus 
<400> 6 

Tyr Phe Ser His Arg Pro Lys Glu Lys Val Arg Thr Asp Ser Asn Asn 
15 10 15 

Glu Asn Ser Val Pro Lys Asp Phe Glu Asn Val Asp Asn Ser Asn Phe 
20 25 30 

Ala Pro Arg Thr Gin Lys Gin Lys His Gin Pro Glu Leu Ala Lys Lys 
35 40 45 

Pro Leu Ser Arg Gin Lys Glu Arg Leu Gin Arg Lys Leu Gly Ala Gin 
50 55 60 

Asp Lys Gly Gin Gly Gin Ser Val Leu Gly Lys Gly Pro Lys Glu Val 
65 70 75 80 

Leu Pro Pro Arg Glu Lys Ala Pro Gly Asn Ser Ser Gin Gly Lys Asp 
85 90 95 

Leu Ser Arg Bis Ser His Ser Arg Lys Ser Gly Gly Gly Gly Ser Pro 
100 105 110 

Glu Thr Lys Ser Asp Gin Val Pro Lys Cys Asp He Ser Gly Lys Glu 
115 120 125 

Ala He Ser Ala Leu Thr Arg Ala Lys Ser Lys His Cys Arg Gin Glu 
130 135 140 

He Ala Glu Thr Tyr Cys Arg His Lys Leu Gly Leu Leu Met Pro Glu 
145 150 155 160 

Lys Val Ala Arg Phe Cys Pro Leu Glu Gly Lys Ala Asn Lys Asn Val 
165 170 175 

Gin Trp Asp Glu Asp Ala Val Glu Tyr Met Pro Pro Asn Pro Val Arg 
180 185 190 

He Ala Phe Val Leu Val Val His Gly Arg Ala Ser Arg Gin Leu Gin 
195 200 205 
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Arg Met Phe Lys Ala He Tyr His Lys Asp His Phe Tyr Tyr He His 
210 215 220 

Val Asp Lys Arg Ser Asn Tyr Leu His Arg Gin Val Leu Gin Phe Ser 
225 230 235 240 

Arg Gin Tyr Asp Asn Val Arg Val Thr Ser Trp Arg Met Ala Thr He 
■245 250 255 

Trp Gly Gly Ala Ser Leu Leu Ser Thr Tyr Leu Gin Ser Met Arg Asp 
260 265 270 

Leu Leu Glu Met Thr Asp Trp Pro Trp Asp Phe Phe He Asn Leu Ser 
275 280 285 

Ala Ala Asp Tyr Pro He Arg Thr Asn Asp Gin Leu Val Ala Phe Leu 
290 295 300 

Ser Arg Tyr Arg Asp Met Asn Phe Leu Lys Ser His Gly Arg Asp Asn 
305 310 315 320 

Ala Arg Phe He Arg Lys Gin Asp Leu Asp Arg Leu Phe Leu Glu Cys 
325 330 335 

Asp Thr His Met Trp Arg Leu Gly Asp Arg Arg He Pro Glu Gly He 
340 345 350 

Ala Val Asp Gly Gly Ser Asp Trp Phe Leu Leu Asn Arg Lys Phe Val 
355 360 365 

Glu Tyr Val Ala Phe Ser Thr Asp Asp Leu Val Thr Lys Met Lys Gin 
370 ■ 375 380 

Phe Tyr Ser Tyr Thr Leu Leu Pro Ala Glu Ser Phe Phe His Thr Val 
385 390 395 400 

Leu Glu Asn Ser Pro His Cys Asp Thr Met Val Asp Asn Asn Leu Arg 
405 410 415 

He Thr Asn Trp Asn Arg Lys Leu Gly Cys Lys Cys Gin Tyr Lys His 
420 425 430 

He Val Asp Trp Cys Gly Cys Ser Pro Asn Asp Phe Lys Pro Gin Asp 
435 440 445 

Phe His Arg Phe Gin Gin Thr Ala Arg Pro Thr Phe Phe Ala Arg Lys 
450 455 460 

Phe Glu Ala He Val Asn Gin Glu He He Gly Gin Leu Asp Ser Tyr 
465 470 475 480 

Leu Tyr Gly Asn Tyr Pro Ala Gly Thr Pro Gly Leu Arg Ser Tyr Trp 
485 490 495 

Glu Asn Val Tyr Asp Glu Pro Asp Gly He Gin Ser Leu Ser Asp Val 
500 505 510 

Ala Leu Thr Met Tyr His Ser Phe He Arg Leu Gly Leu Arg Arg Ala 
515 520 525 



Glu Ser Ser Leu His Thr Asp Gly Glu Asn Ser Cys Arg Tyr Tyr Pro 
530 535 540 
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Met Gly His Pro Ala Ser Val His Leu Tyr Phe Leu Ala Asp Arg Phe 
545 550 555 560 

Gin Gly Phe Leu lie Lys His His Val Thr Asn Leu Ala Val Ser Lys 
565 570 575 

Leu Glu Thr Leu Glu Thr Trp Met Met Pro Lys Lys Val Phe Lys Val 
580 585 590 

Ala Ser Pro Pro Ser Asp Phe Gly Arg Leu Gin Phe Ser Glu Val Gly 
595 600 605 

Thr Asp Trp Asp Ala Lys Glu Arg Leu Phe Arg Asn Phe Gly Gly Leu 
610 615 620 

Leu Gly Pro Met Asp Glu Pro Val Gly Met Gin Lys Trp Gly Lys Gly 
625 630 635 640 

Pro Asn Val Thr Val Thr Val lie Trp Val Asp Pro Val Asn Val He 
645 650 655 

Ala Ala Thr Tyr Asp He Leu He Glu Ser Thr Ala Glu Phe Thr His 
660 665 670 

Tyr Lys Pro Pro Leu Asn Leu Pro Leu Arg Pro Gly Val Trp Thr Val 
675 680 685 

Lys He Leu His His Trp Val Pro Val Ala Glu Thr Lys Phe Leu Val 
690 695 700 

Ala Pro Leu Thr Phe Ser Asn Lys Gin Pro He Lys Pro Glu Glu Ala 
705 710 715 720 

Leu Lys Leu His Asn Gly Pro Pro Arg Ser Ala Tyr Met Glu Gin Ser 
725 730 735 

Phe Gin Ser Leu Asn Pro Val Leu Ser Leu His He Asn Pro Ala Gin 
740 745 750 

Val Glu Gin Ala Arg Lys Asn Ala Ala Phe Thr Gly Thr Ala Leu Glu 
755 760 765 

Ala Trp Leu Val Gly Gly Thr Trp Thr Ala Met Asp Val Cys Ala Thr 
770 775 780 

Gly Pro Thr Ala Cys Pro Val Met Gin Thr Cys Ser Gin Thr Ala Trp 
785 790 795 800 

Ser Ser Phe Ser Pro Asp Pro Lys Ser Glu Leu Gly Ala Val Lys Pro 
805 810 815 

Asp Gly Arg Leu Arg 
820 



<210> 7 
<211> 2858 
<212> DNA 

<213> Rat coronavirus 

<220> 

<221> CDS 

<222> (56) . . (2650) 
<223> rat XT-2 
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<400> 7 

gcgctcgcgg ccgggttgca gggccggacg cgcgccccgc gtcccgggca ggaag atg 58 

Met 
1 

gtg gcg age gcg egg gtg cag aag ctg gtg egg cgc tac aag ctg gcg 106 
Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu Ala 
5 10 15 

ate gee ace gcg ctg gec ate ctg ctg ctg cag ggc ctg gtg gtg tgg 154 
lie Ala Thr Ala Leu Ala lie Leu Leu Leu Gin Gly Leu Val Val Trp 
20 25 30 

age ttc age ggc ctg gag gag gac gag ccg ggc gag aaa gga agg cag 202 
Ser Phe Ser Gly Leu Glu Glu Asp Glu Pro Gly Glu Lys Gly Arg Gin 
35 40 45 

aga aag cca egg ccg eta gac cct gga gag ggt tec aag gac acg gac 250 
Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly -Ser Lys Asp Thr Asp 
50 ' 55 60 65 

age tct gca ggg cgt agg ggc agt get ggg aga agg cac gga egg tgg 298 
Ser Ser Ala Gly Arg Arg Gly Ser Ala Gly Arg Arg His Gly Arg Trp 
70 75 80 

egg ggc cgc gec gag age cca ggt gtg cct gtg gec aag gtg gtg egg 34 6 
Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val Arg 
85 90 95 

gca gta ace agt egg cag aga gee age cga egg gtg ccc cct gec cca 394 
Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala Pro 
100 105 110 

cct cca gag gca cca ggc cgc cag aac ctg agt gga gca gca gec ggg 442 
Pro Pro Glu Ala Pro Gly Arg Gin Asn Leu Ser Gly Ala Ala Ala Gly 
115 120 125 

gaa gca ctg ata ggt gec gec ggc ttc cca caa cat gga gac aca ggg 4 90 
Glu Ala Leu lie Gly Ala Ala Gly Phe Pro Gin His Gly Asp Thr Gly 
130 135 140 145 

agt gtg gag ggt gee cct cag ccc acg gat aac age ttc act ccg aag 538 
Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Ser Phe Thr Pro Lys 
150 s 155 160 

tgt gag att gtg ggc aag gat gca ctg tea gca ctg gee egg gec age 58 6 
Cys Glu lie Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala Ser 
165 170 175 

ace aag cat tgt caa caa gag att get aat gta gtg tgc ctg cac caa 634 
Thr Lys His Cys Gin Gin Glu lie Ala Asn Val Val Cys Leu His Gin 
180 185 190 

get ggg aac eta atg ccc aag tct gtg ccc egg cac tgc cag ctg get 682 
Ala Gly Asn Leu Met Pro Lys Ser Val Pro Arg His Cys Gin Leu Ala 
195 200 205 

ggc aag gtg age cct ggc ate cag tgg gaa gag gtc egg gec cag cag 730 
Gly Lys Val Ser Pro Gly He Gin Trp Glu Glu Val Arg Ala Gin Gin 
210 215 220 225 

cct gtg agt ggc cct ctg gta cgc ate gec tac atg ctg gtg gtt cac 778 
Pro Val Ser Gly Pro Leu Val Arg He Ala Tyr Met Leu Val Val His 
230 235 240 



WO 01/49831 PCT/EP00/13311 

22 / 32 

ggc cgt get gtg cgc cag ctg aag cgt ctt ctg aag gec gtc tac cac 82 6 

Gly Arg Ala Val Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr His 
245 250 255 

gag gag cac ttc ttt tat att cat gtg gac aag cgt tec aac tac ctg 874 
Glu Glu His Phe Phe Tyr lie His Val Asp Lys Arg Ser Asn Tyr Leu 
260 265 270 

tac egg gag gtg gta gag ctg gec cag cac tac gac aat gta egg gta 922 
Tyr Arg Glu Val Val Glu Leu Ala Gin His Tyr Asp Asn Val Arg Val 
275 280 285 

aca cct tgg cgc atg gtc acc ate tgg ggt ggg get age ctt ctg agg 970 
Thr Pro Trp Arg Met Val Thr lie Trp Gly Gly Ala Ser Leu Leu Arg 
290 295 300 305 

atg tac ctg egg age atg aag gac ctt ctg gag act cct ggc tgg acc 1018 
Met Tyr Leu Arg Ser Met Lys Asp lieu Leu Glu Thr Pro Gly Trp Thr 
310 315 320 

tgg gac ttc ttc ate aac ctg age get act gac tat cca acc agg acg 1066 
Trp Asp Phe Phe lie Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg Thr 
325 330 335 

aat gag gag ctg gta gcg ttc tta tec aag aac egg gac aag aat ttc 1114 
Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn Phe 
340 345 350 

etc aag tea cac ggg cga gac aat tec agg ttc ate aag aaa caa ggc 1162 
Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe lie Lys Lys Gin Gly 
355 360 365 

ctg gac egg ctt ttc cat gag tgt gat tec cac atg tgg cgc ctg ggt 1210 
Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu Gly 
370 375 380 385 

gaa egg cag ate ccg gca ggc ate gtg gtg gat ggt ggc tct gac tgg 1258 
Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly Gly Ser Asp Trp 
390 395 400 

ttc gtg ctg aca cgc age ttt gtg gaa tat gtg gtg tat aca gag gat 1306 
Phe Val Leu Thr Arg Ser Phe Val Glu Tyr -Val Val Tyr Thr Glu Asp 
405 410 415 

cct ctg gtg gee cag ctt cgc cag ttc tat aca tac aca ttg ctt cca 1354 
Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu Pro 
420 425 . 430 

gec gag tec ttc ttc cac aca gtg ctg gag aac age cca gec tgt gag 1402 
Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Pro Ala Cys Glu 
435 440 445 

age eta gtg gac aac aac ctg egg gtt acc aac tgg aac egg aag ctg 1450 
Ser Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys Leu 
450 455 460 465 

ggc tgc aag tgc cag tac aag cac ate gtg gac tgg tgt ggc tgc tec 1498 
Gly Cys Lys Cys Gin Tyr Lys His He Val Asp Trp Cys Gly Cys Ser 
470 475 480 

ccc aac gac ttc aag cca cag gac ttc ctg egg ctt cag caa gtc tec 1546 
Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val Ser 
485 490 495 
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aga ccc acc ttc ttt gcc egg aag ttt gag teg act gtg aac cag gaa 1594 

Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin Glu 
500 505 510 

gtc ctg gaa att ttg gac ttc cac ctg tat ggc age tac cca ccc ggc 1642 
Val Leu Glu lie Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro Gly 
515 520 525 

acc cca gcc etc aag gcc tac tgg gag aac ate tac gac atg gcc gat 1690 
Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn He Tyr Asp Met Ala Asp 
530 535 540 545 

ggc cct agt gga etc age gat gtc eta etc act get tac aca gcc ttt 1738 
Gly Pro Ser Gly Leu Ser Asp Val Leu Leu Thr Ala Tyr Thr Ala Phe ' 
550 555 560 

gcc cgt ate agt ctg cgt cat get gcc act gtt tec cca ctg gcc act 1786 
Ala Arg lie Ser Leu Arg His Ala #la Thr Val Ser Pro Leu Ala Thr 
565 570 575 

gca gtc tgc agg ttt gag ccc agg ggg ttg ccg tec age gtg cac ctg 1834 
Ala Val Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His Leu 
580 585 590 

tat ttc tat gac gac cat ttc cag ggc tac ctg gtg acg cag gca gtg 1882 
Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala Val 
595 600 605 

cag ccc tea gcc cag ggg cca gca gag aca ctt gag atg tgg ctg atg 1930 
Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu Met 
610 615 620 625 

ccc cag agg ttg ctg aag ccg ttg ggg cac agt gac cag gcc age egg 1978 
Pro Gin Arg Leu Leu Lys Pro Leu Gly His Ser Asp Gin Ala Ser Arg 
630 635 640 

etc cag agt ctg gag gtt ggc act gag tgg gac ccc aaa gaa cgt etc 2026 
Leu Gin Ser Leu Glu Val Gly Thr Glu Trp Asp Pro Lys Glu Arg Leu 
645 650 655 

ttc egg aac ttt ggg ggc ctg ttg gga cca ctg gat gaa cct gtg gcc 2074 
Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro Val Ala 
660 665 670 

atg cag cgc tgg gcc egg ggc ccc aac etc aca gcc act gtg gtc tgg 2122 
Met Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val Val Trp 
675 680 685 

att gac ccc acc tat gtt gtg gcc aca tec tat gac ate acg gta gat 2170 
He Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr Val Asp 
690 695 700 705 

gcg gac act gaa gtc acg cag tac aag ccc cca ctg age ctg cca ctg 2218 
Ala Asp Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Leu Pro Leu 
710 715 720 

egg cca gga gcc tgg act gtt cga ttg ctt cag ttc tgg gag ccc ctg 2266 
Arg Pro Gly Ala Trp Thr Val Arg Leu Leu Gin Phe Trp Glu Pro Leu 
725 730 735 

ggt gag acc cgc ttc etc gtg ctg cca ttg acc ttc aac cac aaa eta 2314 
Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn His Lys Leu 
740 745 750 
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cct etc agg aaa gat gat gec age tgg ctg cat gcg gga cca ccc cac 2362 
Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro Pro His 
755 760 765 

aac gaa tac atg gaa cag agt ttc cag gga eta agt ggc ate ctg aat 2410 
Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Gly lie Leu Asn 
770 775 780 785 

ctg cct cag cca gag gec gtg gag gag get gee egg egg cac aca gag 24 58 
Leu Pro Gin Pro Glu Ala Val Glu Glu Ala Ala Arg Arg His Thr Glu 
790 795 800 

etc aca ggt cct gca ctt gag gee tgg aca gat ggg gaa ctg age agt 2506 
Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Gly Glu Leu Ser Ser 
805 810 815 

ttc tgg tct gtt gca gga ttg tgt gee ata ggg cct tct tct tgt ccc 2554 
Phe Trp Ser Val Ala Gly Leu Cys jtLa He Gly -Pro Ser Ser Cys Pro 
820 825 830 

tec ctg gag etc tgc aga ctg ace age tgg age tct ctg tct cct gac 2602 
Ser Leu Glu Leu Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro Asp 
835 840 845 

ccc aag tea gag ctg ggg cct gtc aaa get gac gga cga etc agg tag 2650 
Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
850 855 860 865 

caggacccca gccagcacaa cceggaggag ceggggaatt gcaccttaca gacaatggag 2710 
ggaegtctet cctctggtag ccagggagcc ctagaacagt ggtggggctg tctgtctgtc 2770 
aggctttcct aggaaatgga cgacatcagg ccctgctgag atccatgaag tttccccact 2830 
ggaggaggga gccccactgg cagcacag 2858 



<210> 8 
<211> 864 
<212> PRT 

<213> Rat coronavirus 
<400> 8 

Met Val Ala Ser Ala Arg Val Gin Lys Leu Val Arg Arg Tyr Lys Leu 
1 5 10 15 

Ala He Ala Thr Ala Leu Ala He Leu Leu Leu Gin Gly Leu Val Val 
20 25 30 

Trp Ser Phe Ser Gly Leu Glu Glu Asp Glu Pro Gly Glu Lys Gly Arg 
35 40 45 

Gin Arg Lys Pro Arg Pro Leu Asp Pro Gly Glu Gly Ser Lys Asp Thr 
50 55 60 

Asp Ser Ser Ala Gly Arg Arg Gly Ser Ala Gly Arg Arg His Gly Arg 
65 70 75 80 

Trp Arg Gly Arg Ala Glu Ser Pro Gly Val Pro Val Ala Lys Val Val 
85 90 95 

Arg Ala Val Thr Ser Arg Gin Arg Ala Ser Arg Arg Val Pro Pro Ala 
100 105 110 

Pro Pro Pro Glu Ala Pro Gly Arg Gin Asn. Leu Ser Gly Ala Ala Ala 
115 120 125 
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Gly Glu Ala Leu lie Gly Ala Ala Gly Phe Pro Gin His Gly Asp Thr 

130 135 140 

Gly Ser Val Glu Gly Ala Pro Gin Pro Thr Asp Asn Ser Phe Thr Pro 
145 150 155 160 

Lys Cys Glu lie Val Gly Lys Asp Ala Leu Ser Ala Leu Ala Arg Ala 
165 170 175 

Ser Thr Lys His Cys Gin Gin Glu lie Ala Asn Val Val Cys Leu His 
180 185 190 

Gin Ala Gly Asn Leu Met Pro Lys Ser Val Pro Arg His Cys Gin Leu 
195 200 205 

Ala Gly Lys Val Ser Pro Gly lie Gin Trp Glu Glu Val Arg Ala Gin 
210 215 220 

Gin Pro Val Ser Gly Pro Leu Val Arg He Ala Tyr Met Leu Val Val 
225 230 235 240 

His Gly Arg Ala Val Arg Gin Leu Lys Arg Leu Leu Lys Ala Val Tyr 
245 250 255 

His Glu Glu His Phe Phe Tyr He His Val Asp Lys Arg Ser Asn Tyr 
260 265 270 

Leu Tyr Arg Glu Val Val Glu Leu Ala Gin His Tyr Asp Asn Val Arg 
275 280 285 

Val Thr Pro Trp Arg Met Val Thr He Trp Gly Gly Ala Ser Leu Leu 
290 295 300 

Arg Met Tyr Leu Arg Ser Met Lys Asp Leu Leu Glu Thr Pro Gly Trp 
305 310 315 320 

Thr Trp Asp Phe Phe He Asn Leu Ser Ala Thr Asp Tyr Pro Thr Arg 
325 330 335 

Thr Asn Glu Glu Leu Val Ala Phe Leu Ser Lys Asn Arg Asp Lys Asn 
340 345 350 

Phe Leu Lys Ser His Gly Arg Asp Asn Ser Arg Phe He Lys Lys Gin 
355 360 365 

Gly Leu Asp Arg Leu Phe His Glu Cys Asp Ser His Met Trp Arg Leu 
370 375 380 

Gly Glu Arg Gin He Pro Ala Gly He Val Val Asp Gly Gly Ser Asp 
385 390 395 400 

Trp Phe Val Leu Thr Arg Ser Phe Val Glu Tyr Val Val Tyr Thr Glu 
405 410 415 

Asp Pro Leu Val Ala Gin Leu Arg Gin Phe Tyr Thr Tyr Thr Leu Leu 
420 425 430 

Pro Ala Glu Ser Phe Phe His Thr Val Leu Glu Asn Ser Pro Ala Cys 
435 440 445 



Glu Ser Leu Val Asp Asn Asn Leu Arg Val Thr Asn Trp Asn Arg Lys 
450 455 460 



WO 01/49831 PCT/EP00/13311 

26 / 32 

Leu Gly Cys Lys Cys Gin Tyr Lys His lie Val Asp Trp Cys Gly Cys 
465 470 475 480 

Ser Pro Asn Asp Phe Lys Pro Gin Asp Phe Leu Arg Leu Gin Gin Val 
485 490 495 

Ser Arg Pro Thr Phe Phe Ala Arg Lys Phe Glu Ser Thr Val Asn Gin 
500 505 510 

Glu Val Leu Glu He Leu Asp Phe His Leu Tyr Gly Ser Tyr Pro Pro 
515 520 525 

Gly Thr Pro Ala Leu Lys Ala Tyr Trp Glu Asn He Tyr Asp Met Ala 
530 535 540 

Asp Gly Pro Ser Gly Leu Ser Asp Val Leu Leu Thr Ala Tyr Thr Ala 
545 550 555 ' 560 

Phe Ala Arg He Ser Leu Arg His Ala Ala Thr Val Ser Pro Leu Ala 
565 570 575 

Thr Ala Val Cys Arg Phe Glu Pro Arg Gly Leu Pro Ser Ser Val His 
580 585 590 

Leu Tyr Phe Tyr Asp Asp His Phe Gin Gly Tyr Leu Val Thr Gin Ala 
595 600 605 

Val Gin Pro Ser Ala Gin Gly Pro Ala Glu Thr Leu Glu Met Trp Leu 
610 615 620 

Met Pro Gin Arg Leu Leu Lys Pro Leu Gly His Ser Asp Gin Ala Ser 
625 630 635 640 

Arg Leu Gin Ser Leu Glu Val Gly Thr Glu Trp Asp Pro Lys Glu Arg 
645 650 655 

Leu Phe Arg Asn Phe Gly Gly Leu Leu Gly Pro Leu Asp Glu Pro Val 
660 665 670 

Ala Met Gin Arg Trp Ala Arg Gly Pro Asn Leu Thr Ala Thr Val Val 
675 680 685 

Trp He Asp Pro Thr Tyr Val Val Ala Thr Ser Tyr Asp He Thr Val 
690 695 700 

Asp Ala Asp Thr Glu Val Thr Gin Tyr Lys Pro Pro Leu Ser Leu Pro 
705 710 715 720 

Leu Arg Pro Gly Ala Trp Thr Val Arg Leu Leu Gin Phe Trp Glu Pro 
725 730 735 

Leu Gly Glu Thr Arg Phe Leu Val Leu Pro Leu Thr Phe Asn His Lys 
740 745 750 

Leu Pro Leu Arg Lys Asp Asp Ala Ser Trp Leu His Ala Gly Pro Pro 
755 760 765 

His Asn Glu Tyr Met Glu Gin Ser Phe Gin Gly Leu Ser Gly He Leu 
770 775 .780 



Asn Leu Pro Gin Pro Glu Ala Val Glu Glu Ala Ala Arg Arg His Thr 
785 790 795 800 
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Glu Leu Thr Gly Pro Ala Leu Glu Ala Trp Thr Asp Gly Glu Leu Ser 
805 810 815 

Ser Phe Trp Ser Val Ala Gly Leu Cys Ala He Gly Pro Ser Ser Cys 
820 825 830 

Pro Ser Leu Glu Leu Cys Arg Leu Thr Ser Trp Ser Ser Leu Ser Pro 
835 840 845 

Asp Pro Lys Ser Glu Leu Gly Pro Val Lys Ala Asp Gly Arg Leu Arg 
850 855 860 



<210> 9 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 9 

Cys Ser Arg Gin Lys Glu Leu Leu Lys Arg Lys Leu Glu Gin Gin Glu 
15 10 15 

Lys 



<210> 10 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 10 

Gin Glu Glu Glu Gly Ser Gly Gly Gly Gin Lys 
15 10 

<210> 11 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 
<400> 11 

Ser Glu Asn Glu Gly Ser Gly Met Ala Glu Gin Lys 
15 10 

<210> 12 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: peptide 



<400> 12 

Thr Glu Asn Glu Gly Ser Gly Leu Thr Asn He Lys 
15 10 
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<210> 13 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gtgggtatgc agaagtgggg gaagggac 2B 



<210> 14 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 14 

ccctccgcaa tgcctaca 18 



<210> 15 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

actgaggtca cgcaatacaa 20 



<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

gccgcactca ggttgatgaa gaagt 25 



<210> 17 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

accaccagga caaaggcgat tctga 25 



<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 18 

agtcgaacag tccagggccc 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

cacgatctcg cacttggggg 

<210> 20 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 20 

ctattcgatg atgaagatac cccaccaaac cc 



<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 21 

ggaagagctg ggtgtggaat 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 22 

tttcccgttg agatcctgct 



<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 23 

acagacagca acaacgagaa 
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<210> 24 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 24 

aaaggaaggc agaggaagc 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 25 

acccctccac tgtctgtaag 

<210> 26 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 26 

Glu Leu Gly Ala Lys 
1 5 

<210> 27 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 27 
Glu Leu Leu Lys 
1 

<210> 28 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 



<400> 28 

Asp Met Asn Phe Leu Lys 
1 5 
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<210> 29 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 29 

lie Ala Ser Pro Pro Ser Asp Phe Gly Arg 
15 10 

<210> 30 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 30 
Leu Leu Leu Asp 
1 

<210> 31 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 31 

Asp Phe Glu Asn Val Asp Asn Ser Asn Phe Ala Pro Arg 
15 10 

<210> 32 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 32 

Pro Thr Phe Phe Ala Arg 
1 5 

<210> 33 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 33 

Leu Gin Phe Ser Glu Val Gly Thr Asp Xaa Asp Ala 
15 10 



WO 01/49831 



PCT/EP00/13311 



32 / 32 

<210> 34 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 34 

Glu Leu Gly Ala Val Lys Pro Asp Gly Arg Leu 
15 10 

<210> 35 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 35 

Glu Leu Leu Lys Arg Lys Leu Glu Gin Gin Glu Lys 
15 10 

<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: digestion 
peptide 

<400> 36 

Leu Gly Leu Leu Met Pro Glu Lys 
1 5 
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